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Technical Field 

The present invention relates generally to the structure of the ligand 
binding domain of PPARa, and more particularly to the structure of the ligand 
binding domain of PPARa in complex with a ligand. The invention further 
relates to methods by which modulators and ligands of PPARa and other 
PPARs can be identified. 
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PCR 
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Abbreviations 
adenosine triphosphate 
adenosine diphosphate 
bovine serum albumin 
complementary DNA 
DNA binding domain 
dimethyl sulfoxide 
deoxyribonucleic acid 
dithiothreitol 

ethylenediaminetetraacetic acid 

N-2-Hydroxyethylpiperazine-N'-2-ethanesulfonic 

acid 

kilodalton(s) 

ligand binding domain 

mouse peroxisome proliferator activated receptor 
nucleotide diphosphate 
nucleotide triphosphate 
polyacrylamide gel electrophoresis 
polymerase chain reaction 
isoelectric point 

peroxisome proliferator-activated receptor 
peroxide proliferator-activated receptor alpha 
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PPRE 

rPPAR 

RXR 

SDS 

SDS-PAGE 



PPAR response element 

rat peroxisome proliferates activated receptor 

retinoid X receptor 

sodium dodecyl sulfate 

sodium dodecyl sulfate polyacrylamide gel 
electrophoresis 



Amino Acid Abbreviations 



Single-Letter Code 


Three-Letter Code 


Name 


A 


Ala ' 


Alanine 


V 


Val 


Valine' 


L 


Leu 


Leucine 




lie 


Isoleucihe 


p o ■ " 


Pro 


Proline 




Phe ' 


Phenylalanine 


W 


Trp 


Tryptophan 


M 


Met 


Methionine 


G 


Gly 


Glycine 


S 


Ser 


Serine 


T 


Thr 


Threonine 


C 


Cys 


Cysteine 


Y 


Tyr 


Tyrosine 


N 


Asn 


Asparagine 


Q 


Gin 


Glutamine 


D 


Asp 


Aspartic Acid 


E 


Glu 


Glutamic Acid 


K 


Lys 


Lysine 


R 


Arg 


Arginine 


H 


His 


Histidine 



(0 2003 Copyright Derwent Information Ltd. 



WO 02/064632 PCT7US02/03278 

4 

-3- 



Functionally Equivalent Codons 





Amino Acid 






Codons 




Alanine 


Ala 


A 


GCA GCC GCG GCU 


5 


Cysteine 


Cys 


C 


UGC UGU 




Aspartic Acid 


Asp 


D 


GAC GAL) 




Glumatic acid 


GIu 


E 


GAA GAG 




Phenylalanine 


Phe 


F 


uuc uuu 




Glycine 


Gly 


G 


GGA GGC GGG GGU 


10 


Histidine 


His 


H 


CAC CAU 




Isoleucine 


lie 


I 


AUA AUC AUU 


• 


Lysine 


Lys 


K 


AAA AAG 




Methionine - 


Met 


M 


AUG 




Asparagine 


Asn 


N 


AAC AAU 


15 


Proline. 


Pro 


P 


CGA CCC CCG CCU 




Glutamine 


Gin 


Q 


CAA CAG 




Threonine 


Thr 


T 


ACAACCACG ACU 




vaune 


vai 


V 


GUA bUt/ laUo oUU 




Tryptophan 


Trp 


W 


UGG 


20 


Tyrosine 


Tyr 


Y 


UAC UAU 




Leucine 


Leu 


L 


UUA UUG CUA CUC 
CUG CUU 




Arginine 


Arg 


R 


AGA AGG CGA CGC 
CGG CGU 


25 


Serine 


Ser 


S 


ACG AGU UCA UCC 
UCG UCU 



Background Art 

Nuclear receptors reside in either the cytoplasm or nucleus of 
30 eukaryotic cells and represent a superfamily of proteins that specifically bind a 
physiologically relevant small molecule, such as a hormone or vitamin. As a 
result of a molecule binding to a nuclear receptor, the nuclear receptor 
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changes the ability of a cell to transcribe DNA, i.e. nuclear receptors modulate 
the -transcription of DNA However, they can also have transcription 
independent actions. 

Unlike integral membrane receptors and membrane-associated 
5 receptors, nuclear, receptors reside in either the cytoplasm or nucleus of 
eukaryotic cells. Thus, nuclear receptors comprise a class of intracellular, 
soluble, ligand-regulated transcription factors. Nuclear receptors include but 
are not j limited „ to receptors for glucocorticoids, androgens, mineralcorticoids, 
progestins, estrogens, thyroid hormones, vitamin D, retinoids, icosanoids and 

10 pertinently, peroxisome proliferators. Many nuclear receptors, identified by 
either, sequence homology to known receptors (See, e.g. , Drewes et al. , 
(1996) MoL QelL Biol 16:925-31) or based on their affinity for specific DNA 
binding sites in gene promoters (See, e.g. , Sladek et al. , Genes Dev. 4:2353- 
65), have Mna§certained,)igands and are therefore termed "orphan receptors". 

15 Perpxisomes are organelles that are involved in the 0-oxidation of long- 

chain fatty acids and the catabolism of cholesterol to bile, acids (See, e.g. , 
Vamecq & Draye , (1989) Essays Biochem. 24: 115-225). Peroxisome 
proliferators are a structurally diverse group of compounds which, when 
administered to rodents, elioit dramatic increases in the size and number of 

20 hepatic and renal peroxisomes, as well as concomitant increases in the 
capacity of peroxisomes to metabolize fatty acids via increased expression of 
the enzymes required for the p-oxidation cycle (Lazarow & Fujikl , (1985) Ann. 
J /*ev. Cell B/pA.1: 489-530; Vamecq & Draye , (1989) Essays Biochem. 24: 

115-225; and Nelali et al. , (1988) Cancer Res. 48: 5316-5324). Chemicals of 

25 this group include the fibrate class of hypolipidemic drugs, herbicides, 
phthalate plasticizers, unsaturated fatty acids, and jeukotriene antagonists 
(reviewed in Green , (1992) Biochem. Pharmacol. 43: 393-401). Peroxisome 
proliferation can also be elicited by dietary or physiological factors such as a 
high-fat diet and cold acclimatization. 

30 Peroxisome proliferator activated receptors (PPARs) are activated by 

one of a group of compounds known as peroxisome proliferators. Insight into 
the mechanism by which peroxisome proliferators exert their pleiotropic 
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effects was provided by the identification of a member of the nuclear hormone 
receptor superfamily activated by the chemicals described above (Isseman & 
Green, (1990) Nature 347: 645-50). This receptor, termed peroxisome 
proliferator activated receptor alpha (PPARa), was subsequently shown to be 
5 activated by a variety of medium and long-chain fatty acids. PPARa was also 
shown to modulate expression of a variety of genes containing one or more 
PPAR responsive elements found in their promoter regions. 

It appears that PPARa has a role in the regulation of virtually the entire 
oxidative pathway of fatty acids and their derivatives (See , Lemberger et al. , 

10 (1996) Ann. Rev. Cell. Dev. BioL 12: 335-63). It has also been observed that 
PPARa expression is closely tied to conditions that induce elevated 
glucocorticoid levels such as fasting, diurnal rhythm (Lemberger et al. , (1 996) 
J. BioL Chem. 271 : 1 764-69) and stress. " r ^ " 

Structurally, PPARa comprises 1 three functional domains, the N 

15 terminus region, the DNA binding domain and the ligahd binding domain. 
These domains retain their functional autonomy when they are expressed as 
a chimeric or fusion protein (GSttlicher et al. , (1992) Proc. Natl. Acad. Sci. 
U.S.A. 89: 4653-57). 

PPARa activates transcription by binding to DNA sequence elements, 

20 termed PPAR response elements (PPRE), as heterodimers (dimerization is 
essential for the activity of PPARs) with the retinoid X receptors (RXR) (See , 
Keller & Whali , (1993) Trends Endocrin. Met 4: 291-96), which are 
themselves activated by 9-cis retinoic acid (See , Kliewer et al. , (1992) Nature 
358: 771-74; Gearing et ah , (1993) Proc. Natl. Acad. Sci. U.S.A. 90: 1440-44; 

25 Keller et al. , (1993) Proc. Natl. Acad. Sci. U.S.A. 90: 2160-2164; Heyman et 
aL, (1992) Cell 68: 397^06 and Levin et at. , (1992) Nature 355: 359-61). 
Since the PPARa-RXR complex can be activated by peroxisome proliferators 
and/or 9-cis retinoic acid, the retinoid and fatty acid signaling pathways are 
seen to converge in modulating lipid metabolism. 

30 PPREs have been identified in the enhancers of a number of genes 

that encode proteins that regulate lipid metabolism, including: (1) the three 
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enzymes required for peroxisomal p-oxidation of fatty acids; (2) medium-chain 
acyl-CoA dehydrogenase, a key enzyme in mitochondrial p-oxidation; arid (3) 
aP2, a lipid* binding protein expressed 'exclusively in adipocytes; Thus, the 
nature of the PPAR target genes coupled with the activation of PPARs by fatty 
5 acids and hypolipidemic drugs suggests a physiological role for the PPARs in 
a variety of physiological phenomena, including lipid homeostasis (See, e.g. , 
Keller & Whali , (1993) Trends Endocriri. Met 4: 291-96). PPARs have also 
been ^ implicated ^in . glucose- homeostasis, disorders and in atherosclerosis. 
. These ^conditions -may exist alone or together in a complex -phenotype of 

10 metabolic disorders known as syndrome X. " . >v 

Since the discovery of PPARa, additional subtypes of PPAR have been 
identified ,~;e:g. PPARy and PPAR5, which are spatially differentially expressed. 
Because there^ are ; several subtypes of PPAR, it is 'desirable to identify 
compounds -that* are capable of Selectively interacting with only one of - the 

15 PPAR subtypes, notably PPARa. Compounds capable of interlacing with 
PPARa exclusivi3ly would find a wide variety of applications,! for example, 'in 
the prevention of obesity, for the treatment of diabetes, arid other deleterious 
conditions, as noted above. Development of such compounds, however, has 
been hinderecTby a lack of detailed- structural information on the ligarid 

20 binding domain of PPARa and particularly by a lack of structural information 
on the conformation of the ligand binding domain of PPARa as it binds a 
modulating compound. 

It is believed that PPARa regulates some of the same genes as PPARy 
and PPAR5! However, some genes might be upregulated by one PPAR and 

25 downregulated by another PPAR. Up- or down-regulation of certain genes by 
a PPARy agonist might cause detrimental side effects, it might be possible to 
use a PPARa of PPAR5 agonist, partial agonist or antagonist to down- or up- 
regulate (respectively) these same genes, and thereby reduce the detrimental 
side-effects. More generally, it might be possible to individually up- and 

30 down-regulate specific genes to achieve a specific therapeutic goal by 
administering a PPAR activator (or partial activator) that activates (or 
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deactivates) each PPAR to the appropriate extent. Design .or discovery of 
such a compound would be greatly facilitated by three-dimensional structures 
for each of the three target receptors, PPARa, PPARy and PPAR5. 

Polypeptides, including the ligand binding domain of PPARa, have a 
5 three-dimensional structure determined by the primary amino acid sequence 
and the environment surrounding the polypeptide. This three-dimensional 
structure establishes the polypeptide's activity, stability, binding affinity, 
binding specificity, and other biochemical attributes. Thus, knowledge of a 
protein's three-dimensional structure can provide much guidance in designing 

1 0 agents that mimic, inhibit, or improve its biological activity. 

The three-dimensional structure of a polypeptide can be determined in 
a number of ways. Many of the* most precise methods employ X-ray 
crystallography (See, e.g. . Van Holde , (1971) Physical Biochemistry , Prentice- 
Hail, N. J., 221-39). This technique relies on the ability of crystalline lattices to 

1 5 diffract X-rays or other forms of radiation. Diffraction experiments suitable for 
determining the three-dimensional structure of macromolecules typically 
require high-quality crystals. Unfortunately, such crystals have been 
unavailable for the ligand binding domain of PPARa, as well as many other 
proteins of interest. Thus, high-quality diffracting crystals of the ligand binding 

20 domain of PPARa in complex with a ligand would greatly assist in the 
elucidation of its three-dimensional structure. 

Clearly, the solved crystal structure of the PPARa ligand binding 
domain polypeptide would be useful in the design of .modulators of activity 

mediated by all of the PPARs. Evaluation of the available sequence data has 

i 

25 made it clear that PPARa shares significant sequence homology with the 
other PPARs. Further, PPARa shows structural homology with the three- 
dimensional fold of the other PPARs. Thus, in theory, it might be considered 
feasible to design modulators of PPARa based exclusively on the sequence 
and three-dimensional fold of a different PPAR, for example, PPARy. This 

30 method, however, would likely be unproductive and certainly hindered by a 
lack of subtle structural details of the various binding sites and pertinent 
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residues of PPARa involved in the binding event. A solved crystal structure 
would provide these structural detaiis. " ' " 

The solved PPARa-ligand "crystal structure would provide structural 
details and" insights necessary to design^ a 'modulator of PPARa that 
5 maximizes preferred requirements for any modulator, i.e. potency and 
specificity; : By exploiting the stmcturardetails obtained from a PPAR-Jigand 
crystal" structure, it would be possible to design a PPAR modulator that, 
despite PPARa's : similarity with other PPARsrexploits'the unique structural 
features of PPARd! 'A PPAR modiilator developed using stiiicture-assisted 

10 design would ! take advantage of heretofore ; unkhown PPAR stnjctural 
considerations and thus; be more effective than a modulator developed using 
homold^y-based design. 'Potential v or existent homology models cannot 
provide the necessary 'degree of specificity^ 'A PPAR Vnbdulator designed 
using Iti&S&iifahi* tibdftfinit^ PPARa would' also 

15 provide a starting point for the development of modulators of other PPARs. 

- What is heeded, therefore, is : a crystallized form of a PPARa ligand 
binding domain, 1 preferably in complex with a ligatid. Acquisition 61 crystals of 
the PPARa-ligahd binding domain (LBD) polypeptide will permit the thr^e 
dimensional structure : of PPARa LBD polypeptide to be determined. 

20 Knowledge of the three dimensional structure will facilitate the design of 
modulators of PPARa activity. Such modulators can lead to therapeutic 
compounds to treat a wide range of conditions, including lipid homeostasis 
disorders, glucose homeostasis disorders, inflammation, atherosclerosis and 
syndrome X. 

25 

Summary of the Invention 
A substantially pure PPARa ligand binding domain polypeptide in 
crystalline form is disclosed. Preferably, the crystalline form has lattice 
constants of a = 61 .3 A, b = 103.5 A, c = 49.9 A, a = 90°, p = 90°, y = 90° or 
30 lattice constants of a = 95.58 A, b = 122.06 A, c = 122.10 A, a = 90°, p = 90°, 
y = 90°. Preferably, the crystalline form is an orthorhombic crystalline form. 



(0 2003 Copyright Derwent Information Ltd. 



WO 02/064632 



PCTAJS02/03278 



-9- 

Mo re preferably, the crystalline form has a space group of P2 1 2 1 2 or space 
group P2i2i2l Even more preferably, the PPARa ligand binding domain 
polypeptide has the amino acid sequence shown in SEQ ID NO: 4. Even 
more preferably, the PPARa ligand binding domain has a crystalline structure 
5 further characterized by the coordinates corresponding to Table 2. 

Preferably, the PPARa ligand binding domain polypeptide is in complex 
with a ligand. Optionally, the crystalline form contains one or four PPARa 
ligand binding domain polypeptides in the asymmetric unit. Preferably, the 
crystalline form is such that the three-dimensional structure of the crystallized 

10 PPARa ligand binding domain polypeptide can be determined to a resolution 
of about 1,8 A or better. Even more preferably, the crystalline form contains 
one or more atoms having a molecular weight of 40 grams/mol or greater. 

A method for determining the three-dimensional structure of .a 
crystallized PPARa ligand binding domain polypeptide to a resolution of about 

15 1.8 A or better is disclosed. The method comprises (a) crystallizing a PPARa 
ligand binding domain polypeptide; and (b). analyzing^ the PPARa ligand 
binding domain polypeptide to determine the three-dimensional structure of 
the crystallized PPARa ligand binding domain polypeptide, whereby the three- 
dimensional structure of a crystallized PPARa ligand binding domain 

20 polypeptide is determined to a resolution of about 1 .8 A or better. Preferably, 
the analyzing is by X-ray diffraction. More preferably, the crystallization is 
accomplished by the hanging drop vapor diffusion method, and wherein the 
PPARa ligand binding domain is mixed with an equal volume of reservoir. 
Even more preferably, the reservoir comprises 4-8% PEG 3350, 100-200mM 

25 NaF, and 12-16% 2,5 hexanediol or the reservoir comprises 50 mM bis-tris- 
propane, 4-6% PEG 3350, 150 mM NaNOa, 16% 2,5 hexanediol, and 1-3 mM 
YCI 3 . 

A method of designing a modulator of a PPAR polypeptide is disclosed. 
The method comprises (a) designing a potential modulator of a PPAR 
30 polypeptide that will form bonds with amino acids in a substrate binding site 
based upon a crystalline structure of a PPARa ligand binding domain 



(0 2003 Copyright Derwent Information Ltd. 



WO 02/064632 



PCT/US02/03278 



polypeptide; (b) synthesizing the modulator, and (c) determining whether the 
potential modulator modulates the activity of the PPAR polypeptide, whereby 
a modulator of a PPAR polypeptide is designed. • ?- 

■ : 'A method of designing a modulator that selectively modulates the 
5 activity of a PPAR polypeptide is disclosed. The method comprises (a) 
obtaining a crystalline form of a PPARa ligand binding domain polypeptide; 
(b) evaluating the three-dimensional Structure of the crystallized PPARa 
ligand binding domain polypeptide; and (c) synthesizing a potential modulator 
based on. the three-dimensional crystal structure of the crystallized PPARa 

10 ligand binding domain polypeptide, whereby a modulator that selectively 
modulates the activity of a PPARa polypeptide is designed; Preferably/ the 
method further * comprises contacting a PPARa ligand binding domain 
polypeptide with the potential modulator, and assaying the PPARa ligand 
binding domain polypeptide" for binding of the potential modulator; for a 

1 5 change in activity 6f the PPARa ligand binding domain polypeptide/ or both. 
More preferably, the- crystalline formisin orthorhombic form. Even more 
preferably, the crystals aire such . that the three-dimensional structure of the 
crystallized PPARa ligand binding domain polypeptide can be determined to a 
resolution of about 1 .8 A or better. 1 

20 A method of screening a plurality of compounds for a modulator of a 

PPAR ligand binding domain polypeptide is disclosed. The method comprises 
(a) providing a library of test samples; (b) contacting a crystalline PPARa 
ligand bincling domain polypeptide with each test sample; (c) detecting an 
interaction between a test sample and the crystalline PPARa ligand binding 

25 domain polypeptide; (d) identifying a test sample that interacts with the 
crystalline PPARa ligand binding domain polypeptide; and (e) isolating a test 
sample that interacts with the crystalline PPARa ligand binding domain 
polypeptide, whereby a plurality of compounds is screened for a modulator of 
a PPAR ligand binding domain polypeptide. Preferably, the test samples are 

30 bound to a substrate, and more preferably, the test samples are synthesized 
directly on a substrate. 
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A method for identifying a PPAR modulator is disclosed. The method 
comprises (a) providing atomic coordinates of a PPARa ligand binding 
domain to a computerized modeling system; and (b) modeling ligands that fit 
spatially into the binding pocket of the PPARa ligand binding domain to 
5 thereby identify a PPAR modulator. Preferably, the method further comprises 
identifying in an assay for PPAR-mediated activity a modeled ligand that 
increases or decreases the activity of the PPAR. 

A method of identifying a PPARa modulator that selectively modulates 
the activity of a PPARa polypeptide compared to other polypeptides is 

10 disclosed. The method comprises (a) providing atomic coordinates of a 
PPARa ligand binding domain to. a computerized modeling system; and (b) 
modeling a ligand that fits into the binding pocket of a PPARa ligand binding 
domain and that interacts with conformational^, constrained residues of a 
PPARa conserved among PPAR subtypes to thereby identify a PPARa 

15 modulator. Preferably, the method further comprises identifying in a biological 
assay for PPARa activity a modeled ligand that selectively binds to PPARa 
and increases or decreases the activity of said PPARa. 

A method of designing a modulator of a PPAR polypeptide is disclosed. 
The method comprises (a) selecting a candidate PPAR ligand; (b) determining 

20 which amino acid or amino acids of a PPAR polypeptide interact with the 
ligand using a three-dimensional model of a crystallized protein comprising a 
PPARa LBD; (c) identifying in a biological assay for PPAR activity a degree to 
which the ligand modulates the activity of the PPAR polypeptide; (d) selecting 
a chemical modification of the ligand wherein the interaction between the 

25 amino acids of the PPAR polypeptide and the ligand is predicted to be 
modulated by the chemical modification; (e) performing the chemical 
modification on the ligand to form a modified ligand; (f) contacting the 
modified ligand with the PPAR polypeptide; (g) identifying in a biological 
assay for PPAR activity a degree to which the modified ligand modulates the 

30 biological activity of the PPAR polypeptide; and (h) comparing the biological 
activity of the PPAR polypeptide in the presence of modified ligand with the 
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biological activity of the PPAR .polypeptide in the presence of the unmodified 
ligand, whereby a modulator of a PPAR polypeptide is designed. Preferably, 
the. PPAR polypeptide is a PPARa polypeptide. More preferably; the three- 
dimensional model of a crystallized protein is a PPARa LBD polypeptide with 

5 a bound ligand. Optionally, the method further comprises repeating steps (a) 
through (f), if the biological activity of the PPAR polypeptide in the presence of 
the modified ligand varies from the biological activity of the PPAR polypeptide 
in the presence of the unmodified ligand. 

An assay method for identifying a compound that inhibits binding of a 

10 ligand to a' PPAR poly peptide is disclosed. The assay method comprises (a) 
} incubating a PPAR polypeptide with a ligand in the presence of a test inhibitor 

compound; (b) determining an amount of ligand; that is bound to the PPAR 
polypeptide,: wherein decreased binding of ligand to the PPAR protein in the 
presence of the ^tesitvinhibitor , compound ^relative to binding of ligand^ in . the 

15 absence of the test inhibitor compound is indicative of inhibition; ( and . (c) 
identifying <the :test .cpmpound as an inhibitor of Jigand. binding if. decreased 
ligand binding is observed.^ t/tsJ 

: A method of identifying a PPAR modulator that selectively modulates 
the biological activity of one PPAR subtype compared to PPARa is disclosed. 

20 The .method comprises: (a) providing an atomic structure coordinate set 
describing a PPARa ligand binding domain structure and at least one other 
atomic structure coordinate set describing a PPAR ligand binding domain, 
) each ligand binding domain comprising a ligand binding site; (b) cpmparing 

the. PPAR atomic structure coordinate sets to identify at least one diference 

25 between the sets; (c) designing a candidate ligand predicted to interact with 
the difference of step (b); (d) synthesizing the candidate ligand; and (e) testing 
the synthesized candidate ligand for an ability to selectively modulate a PPAR 
subtype as compared to PPARa, whereby a PPAR modulator that selectively 
modulates the biological activity of one PPAR subtype compared to PPARa is 

30 identified. 

Accordingly, it is an object of the present invention to provide a three 
dimensional structure of the ligand binding domain of PPARa. The object is 
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achieved in whole or in part by the present invention. 

An object of the invention having been stated hereinabove, other 
objects will be evident as the description proceeds, when taken in connection 
with the accompanying Drawings and Laboratory Examples as best described 
5 hereinbelow. 



Brief Description of the Drawings 
10 Figure 1 is a ribbon diagram depicting the PPARa LBD in complex with . 

Compound 1. The PPARa LBD is presented as a ribbon diagram and ) 
Compound 1 is presented as a spacefilling model. 

T ' Figure 2 is a schematic drawing depicting interactions between PPARa 
and Compound 1. Residues that lie within 5.5A of heavy atoms in the ligand 
15 are shown. 

Figure 3 depicts overlayed ball and stick diagrams of PPARa, PPARy 
and PPAR8. PPARa is shown in dark gray, PPARy in medium gray and 
PPARS in light gray. PPAR8 was used as a template in the molecular 
replacement solution of PPARa. . 

20 Figure 4 is a ribbon diagram of the PPARa backbone conformation, 

where the ligand binding pocket of PPARa is identified by a smooth solid 
surface. x 

Figure 5 is a ribbon diagram of the PPARy backbone conformation, 
where the ligand binding pocket of PPARy is identified by a smooth solid 

25 surface. 

Figure 6 is a ribbon diagram of the PPAR8 backbone conformation, 
where the ligand binding pocket of PPAR5 is identified by a smooth solid 
surface. 

Figure 7 is a ball-and-stick model depicting hydrogen bonding 
30 interactions between Compound 1 and PPARa. Water molecules are shown 
as octahedral (six-pointed) crosses. 
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Detailed Description of the Invention 
Until disclosure of the present invention presented herein, the ability to 
obtain crystalline forms of a PPARa LBD has not been realized. And until 
5 disclosure of the present invention presented herein, a detailed - three- 
dimensional crystal structure of a PPARa polypeptide has not been solved. 

In addition to providing structural information, crystalline polypeptides 
provide other advantages. For example; the crystallization process itself 
further purifies the polypeptide; and satisfies one of the classical criteria for 
10 homogeneity. In fact, clusterization frequently provides unparalleled 
") purification "quality,' removing -ii^purities' that are h6t 4? refnoved by other 

purification 'methods' such as HPLC, ' dialysis, conventional - column 
cH romatographyi etc. Moreover, crystalline polypeptides are often -stable at 
ambient temperatures and free bf prbtease ? contamination- and 1 other 
15 degradation associated with solution storage. Crystalline polypeptides can 
alsb : *be useful as ' pharmaceutical preparations. • -Finally/ ? c^ 
techniques in general are largely free of problems : such as denaturation 
associated with other stabilize lydphilization). Once 

crystallization 1 has been accomplished, crystallographic data provides useful 
20 structural information that can assist the design of compounds that can serve 
as agbhists or ahtagohists, as described herein below, in addition, the crystal 
structure provides information useful to map a receptor binding domain, which 
) could then be mimicked by a small non-peptide molecule that would serve as 

an antagonist or agonist 



25 



I. Definitions 



Following long-standing patent law convention, the terms "a" and "an" 
mean "one or more" when used in this application, including the claims. 

As used herein, the term "mutation 0 carries its traditional connotation 
30 and means a change, inherited, naturally occurring or introduced, in a nucleic 
acid or polypeptide sequence, and is used in its sense as generally known to 
those of skill in the art. : 
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As used herein, the term "labeled" means the attachment of a moiety, 
capable of detection by spectroscopic, radiologic or other methods, to a probe 
molecule. 

As used herein, the term "target cell" refers to a cell, into which it is 
5 desired to insert a nucleic acid sequence or polypeptide, or to otherwise effect 
a modification from conditions known to be standard in the unmodified cell. A 
nucleic acid sequence introduced into a target cell can be of variable length. 
Additionally, a nucleic acid sequence can enter a target cell as a component 
of a plasmid or other vector or as a naked sequence. 

10 As used herein, the term "transcription" means a cellular process 

involving the interaction of an RNA polymerase with a gene that directs the ) 
expression as RNA of : the fr structural- information present in the coding 
sequences of the gene. Jhe process includes, but is not limited to the 
following steps: (a) . the transcriptiqn initiation, (b) transcript, elongation, (c) 

15 transcript splicing, (d) transcript capping,. c (e) transcript termination,, (f) 
transcript polyadenylation, (g) nuclear export of the transcript, (h) transcript 
editing, and (i) stabilizing the transcript. N , 

As used herein, the term "expression" generally refers to the cellular 
processes by which a biologically active polypeptide is produced. 

20 As used herein, the term "transcriptiqn factor" means a cytoplasmic or 

nuclear protein which binds to such gene, or binds to an RNA transcript of 
such gene, or binds to another protein which binds to such gene orsuch RNA 
transcript or another protein which in turn binds to such gene or such RNA ^ 
transcript, so as to thereby modulate expression of the gene. Such 

25 modulation can additionally be achieved by other mechanisms; the essence of 
"transcription factor for a gene" is that the level of transcription of the gene is 
altered in some way. 

As used herein, the term "hybridization" means the binding of a probe 
molecule, a molecule to which a detectable moiety has been bound, to a 

30 target sample. 

As used herein, the term "detecting" means confirming the presence of 
a target entity by observing the occurrence of a detectable signal, such as a 
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radiologic or spectroscopic signal that will appear exclusively in the presence 
of the target entity. >..v^ . " >v. .-• • 

As used herein; the term "sequencing 1 ' means the determining «the 
ordered Jinear sequence of nucleic acids or amino acids of a DNA or protein 
5 target, sample,-, using conventional manual -or j automated laboratory 
techniques. ; - - - : 1 :.*■ - 

As used herein, the term "isolated" means oligonucleotides 
substantially free of other nucleic acids, proteins, lipids, carbohydrates or 
other materials with which they can be associated, : such association being 

10 either in cellular material or in a synthesis medium. The term can also be 
applied to polypeptides, in which case the polypeptide will be substantially 
free of nucleic acids, carbohydrates, lipids and other undesired polypeptides. 

.;?As>r used r herein;* the term "substantially pure" u ^ means : that^the 
polynucleotide or polypeptide is substantially . free of the* sequences- and 

15 molecules with which it is associated in its natural state, and those molecules 
used in the isolation procedure. The term "substantially free" means that the 
sample is at least 50%, preferably at least 70%, more preferably: 80%: and 
most preferably 90% free of the materials and compounds with which is it 
associated in nature. ^ ; ; ^ • - L ^ 

20 As used herein, the term "primer 11 means a sequence comprising two or 

more deoxyribonucleotides or ribonucleotides, preferably more than three, 
and more preferably more than eight and most preferably at least about 20 
nucleotides of an exonic or < intronic region. Such oligonucleotides are 
preferably between ten and thirty bases in length. 

25 As used herein, the term "gene" is used for simplicity to refer to a 

functional protein, polypeptide or peptide encoding unit. As will be understood 
by those in the art, this functional term includes both genomic sequences and 
cDNA sequences. Preferred embodiments of genomic and cDNA sequences 
are disclosed herein. 

30 As used herein, the term "DNA segment" means a DNA molecule that 

has been isolated free of total genomic DNA of a particular species. In a 
preferred embodiment, a DNA segment encoding a PPARa polypeptide refers 
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to a DNA segment that comprises SEQ ID NOs: 1 and 3, but can optionally 
comprise fewer or additional nucleic acids, yet is isolated away from, or 
purified free from, total genomic DNA of a source species, such as Homo 
sapiens. Included within the term M DNA segment" are DNA segments and 
5 smaller fragments of such segments, and also recombinant vectors, including, 
for example, plasmids, cosmids, phages, viruses, and the like. 

As used herein, the phrase "enhancer-promoter 11 means a composite 
unit that contains both enhancer and promoter elements. An enhancer- 
promoter is operatively linked to a coding sequence that encodes at least one 
10 gene product 

As used herein, the phrase "operatively linked" means that an 1 
enhancer-promoter is connected to a coding sequence in such a way that the 
transcription of that coding sequence is controlled and regulated , by that 
enhancer-promoter. Techniques for operatively linking an enhancer-promoter 

15 to a coding sequence are well known in the art; the precise orientation and 
location relative to a coding sequence of interest is dependent, inter alia, upon 
the specific nature of the enhancer-promoter. 

As used herein, the terms "candidate substance" and "candidate 
compound" are used interchangeably and refer to a substance that is believed 

20 to interact with another moiety, for example a given ligand that is believed to 
interact with a complete, or a fragment of, a PPAR polypeptide, and which can 
be subsequently evaluated for such an interaction. Representative candidate 
substances or compounds include xenobiotics such as drugs and other J 
therapeutic agents, carcinogens and environmental pollutants, natural 

25 products and extracts, as well as endobiotics such as steroids, fatty acids and % 
prostaglandins. Other examples of candidate compounds that can be 
investigated using the methods of the present invention include, but are not 
restricted to, agonists and antagonists of a PPAR polypeptide, toxins and 
venoms, viral epitopes, hormones (e.g., opioid peptides, steroids, etc.), 

30 hormone receptors, peptides, enzymes, enzyme substrates, co-factors, 
lectins, sugars, oligonucleotides or nucleic acids, oligosaccharides, proteins, 
small molecules and monoclonal antibodies. 
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As used herein, the term "biological activity* means any observable 
effect flowing from interaction between a PPAR polypeptide and a ligand. 
Representative, but non-limiting, examples of biological' activity in the context 
of the present invention^ include dimerization J of PPARa with RXR, 
5 phosphorylation; and association of PPARa with DNA. r 

" ; As used herein, the "Wrm a m6difiecl 0 means an alteration from an 
entity's ndrmally occurring' state. r An (entity can be modified by removing 
discrete chemical units' or by adding discrete chemical units. The temri 
"modified" encompasses detectable labels as well as those entities added as 
10 aids in purification. : , . . . ; 

As used herein, the terms "structure coordinates" and "structural 
TOorxiinates n - mean> : mathematical • coordinates derived from mathematical 
equations rdated to the patterns obtain^ 

beam of X : rays by the ;atoms (scattering centers) -of a 'molecule ^iri crystal 
15 form. The diffraction data are used to calculate an electron density map of the 
repeating unit of*the 'crystal. The electron density maps are used to establish 
the positions of the individual atoms within the unit cell.of the crystal. 

: Those of skill in the art understand that a set of structure coordinates 
determined by X-ray crystallography is not without standard error. For the 
20 purpose of this invention, any set of structure coordinates for PPARa or a 
' PPARa mutant that have a root mean square (RMS) de^viatioh of no more 
than i.O A when superimposed, using the polypeptide backbone atoms, on 
the structure coordinates listed in Table 2 shall be considered identical. 

As used herein, the term "space 7 group" means the arrangement of 
25 symmetry elements of a crystal. 

As used herein, the term "molecular replacement" means a method 
that involves generating "a preliminary model of the wild-type PPARa ligand 
binding domain, or a PPARa mutant drystal whose structure coordinates are 
unknown, by orienting and positioning : a molecule whose structure 
30 coordinates are known (e.g., PPARS) within the unit cell of the unknown 
crystal so as best to account for the observed diffraction pattern of the 
unknown crystal. Phases can then be calculated from this model and 
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combined with the observed amplitudes to give an approximate Fourier 
synthesis of the structure whose coordinates are unknown. This, in turn, can 
be subject to any of the several forms of refinement to provide a final, 
accurate structure of the unknown crystal. (Lattman, (1985) Method EnzymoL, 
5 115: 55-77; Rossmann , ed, (1972) The Molecular Replacement Method , 
Gordon & Breach, New York.) Using the structure coordinates of the ligand 
binding domain of PPARa provided by this invention, molecular replacement 
can be used to determine the structure coordinates of a crystalline mutant or 
homologue of the PPARa ligand binding domain, or of a different crystal form 

10 of the PPARa ligand binding domain. 

As used herein, the terms "p-sheet D and "beta-sheet", mean the 
conformation of a polypeptide chain, stretched x into an extended zig-zig 
conformation. Portions of polypeptide chains that run "parallel" all run in the 
same direction. Polypeptide chains that are "antiparallel" run in the opposite 

15 direction from the parallel chains. . ; . . 

As used herein, the terms "a-helix" and "alpha-helix"^ mean the 
conformation of a polypeptide chain wherein the polypeptide backbone Is 
wound around the long axis of the molecule in a left-handed or right-handed 
direction,-and the R groups of the amino acids protrude outward from the 

20 helical backbone, wherein the repeating unit of the structure is a single tumoff 
the helix, which extends about 0.56 nm along the long axis. 

As used herein, the term "unit cell" means a basic parallelipiped 
shaped block. The entire volume of a crystal can be constructed by regular .) 
assembly of such blocks. Each unit cell comprises a complete representation 

25 of the unit of pattern, the repetition of which builds up the crystal. Thus, the 
term "unit cell" means the fundamental portion of a crystal structure that is 
repeated infinitely by translation . in three dimensions. A unit cell is 
characterized by three vectors a, b, and c, not located in one plane, which 
form the edges of a parallelepiped. Angles a, p and y define the angles 

30 between the vectors: .angle a is the angle between vectors b and c; angle p is 
the angle between vectors a and c; and angle y is the angle between vectors 
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a and b. The entire volume of a crystal can be constructed by regular 
assembly of unit cells; each unit cell comprises a, complete representation of 
the unit of pattern, the repetition of which builds_up the crystal. 

• : As used, herein, "prthorhombic unit cell", means a unit cell . wherein 
5 a * b * c; and a = p = ,y = 90V The vectors a, b and c describe the unit cell 
edges and the angles a, p, and y describe the unit cell angles. 

As used herein, the term "crystal lattice" means the array. ,of points 
defined I by the vertices pf packed unit cells.. ... - r ^ r > r . i: ;- :: . .. ;ftf?r:r . 
As used herein, the .term "PPAR". means any polypeptide sequence 

10 that, can be; aligned with at least one of human PPARa,: PPARy T pr^PARS 
such that at least 50% of the amino acids are Identical to the corresponding 
amino acid in the; human PPARa, PPARy or PPAR8. The term TPAR^also 
encompasses, nucleic acid sequences where the corresponding translated 
protein: sequence^ can l becponsidered; to bje a PPAR. The . term /PPAR" 

15 encompasses at least the PPARa, PPARy and PPAR5 ^subtypes. -Thaterm 
U PPAR D includes Invertebrate, homologs; preferably,: PPAR nucleic acids ,and 
polypeptides are isolated frorn eukaryotip sources. "PPAR" further includes 
vertebrate homologs ,of PPAR family members, including, but not limit^d ; 4o, 
mammalian and avian homolog. Representative mammalian homologs of 

20 PPAR family members include, but are not limited to, ^murine and human 
homologs. 

As used herein, the terms "PPARa gene product", "PPARa protein", 
"PPARa polypeptide", and "PPARa peptide" are used interchangeably and 
mean peptides having amino acid sequences which are substantially identical 

25 to native amino acid sequences from the organism of interest and which are 
biologically active in that they comprise all or a part of the amino acid 
sequence of a PPARa. polypeptide, or cross-react with antibodies raised 
against a PPARa polypeptide, or retain all or some of the biological activity 
(e.g., DNA or ligand binding ability and/or dimerization ability) of the native 

30 amino acid sequence or protein. Such biological activity can include 
immunogenicity. 
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In the present invention, the terms "PPARa gene product", "PPARa 
protein", "PPARa polypeptide", and "PPARa peptide" are used 
interchangeably and mean to the preferred subtype of the PPAR family, 
namely PPARa, which comprises the amino acid sequence of SEQ ID NO: 2. 
5 As used herein, the terms "PPARa gene product", "PPARa protein", 

"PPARa polypeptide", and "PPARa peptide" also include analogs of a PPARa 
polypeptide. By "analogs is intended that a DNA or peptide sequence can 
contain alterations relative to the sequences disclosed herein, yet retain all or 
some of the biological activity of those sequences. Analogs can be derived 

10 from genomic nucleotide sequences as are disclosed herein or from other 
organisms, or can be created synthetically. Those skilled in the art will 
appreciate that other analogs, as yet undisclosed or undiscovered, can be 
used to design and/or construct PPARa analogs. There is no need for a 
"PPARa gene product", "PPARa protein", "PPARa polypeptide", or "PPARa 

15 peptide" to comprise all or substantially all of the amino acid sequence of a 
PPARa polypeptide gene product. Shorter or longer- sequences are 
anticipated to be of use in the invention; shorter sequences are herein 
referred to as "segments". Thus, the terms "PPARa gene product", "PPARa 
protein", "PPARa polypeptide", and "PPARa peptide" also include fusion or 

20 recombinant PPARa polypeptides and proteins comprising sequences of the 
present invention. Methods of preparing such proteins are disclosed herein 
and are known in the art. 

As used herein, the term "polypeptide" means any polymer comprising 
any of the 20 protein amino acids, regardless of its size. Although "protein" is 

25 often used in reference to relatively large polypeptides, and "peptide" is often 
used in reference to small polypeptides, usage of these terms in the art 
overlaps and varies. The term "polypeptide" as used herein refers to 
peptides, polypeptides and proteins, unless otherwise noted. As used herein, 
the terms "protein", "polypeptide" and "peptide" are used interchangeably 

30 herein when referring to a gene product. 
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As used herein, the term "modulate 0 means an increase, decrease; or 
other alteration of any or air chemical and biological activities or properties of 
a wild-type or imutant PPAR polypeptide; preferably a wild-type or mutant 
PPARot polypeptide. The term "modulation" as used herein refers to both 
5 upregulatiorv (i.e., activation or stimulation) and downre'gulation (i.e. inhibition 
or suppression) of a response. 

As used herein, the' terms "binding pocket of the PPARa ligand binding 
domain 0 , "PPARa ligand binding pocket* and "PPARa binding pocket" are 
used ihterchahgeaibly; and refer to the teirge cavity within the PPARa ligand 

10 binding domain where Compound 1 binds. This cavity may be empty/or may 
cohtaih water molecules or other mdlebules from the solvent, or may contain 
ligand 'atoms! Tfie "main" binding po^t includies the region of space hot 
occupied by atoms of PPARa that cbmpnses residues lfe-241, l^u-^7, Ala- 
2^6; Glu^Mf lleu^254; Vaj-255! 11^272!," Ph^273, Cys-275, Cys^fiS/ Gln- 

15 277, Thr-279, Ser-280, Tyr-314, lle-317, 4 Ph^18, Leii-32i, Met-330, Val-332, 
^a^333e-33^ Lei^34< lle-354, Met-355; rtis- 4*0, Val-444; Leu-456, Leu- 
460 and Tyr-464. The binding pocket also includes regions of space near 
the "niain" binding pocket that not occupied by atoms of PPARa But that are 
near the "main" binding pocket, and that are contiguous with the "main" 

20 binding pocket. 

As used herein, the terms "PPAR gene" and "recombinant PPAR gene" 
mean a nucleic acid molecule comprising an open reading frame encoding a 
PPAR polypeptide of the present invention, including both exon and 
(optionally) intron sequences. 

25 As used herein, the term "gene" is used for simplicity to refer to a 

functional protein, polypeptide or peptide encoding unit. As will be understood 
by those in the art, this functional term includes both genomic sequences and 
cDNA sequences. Preferred embodiments of genomic and cDNA sequences 
are disclosed herein. 

30 As used herein, the term "DNA sequence encoding a PPAR 

polypeptide" can refer to one or more coding sequences within a particular 
individual. Moreover, certain differences in nucleotide sequences can exist 
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between individual organisms, which are called alleles. It is possible that 
such allelic differences might or might not result in differences in amino acid 
sequence of the encoded polypeptide yet still encode a protein with the same 
biological activity. As is well known, genes for a particular polypeptide can 
5 exist in single or multiple copies within the genome of an individual. Such 
duplicate genes can be identical or can have certain modifications, including 
nucleotide substitutions, additions or deletions, all of which still code for 
polypeptides having substantially the same activity. 

As used herein, the term "intron" means a DNA sequence present in a 

1 0 given gene that is not translated into protein. 

As used herein, the term "interact" means detectable interactions } 
between molecules, such as can be detected using, for example, a yeast two 
hybrid , assay. The term "interact" is also meant to include "binding" 
interactions between molecules. Interactions can, for example, be prptein- 

15 protein or protein-nucleic acid in nature. . 

As used herein, the terms "cells," "host cells" or "recombinant host 
cells" are used interchangeably and mean not only to the particular subject 
cell, but also to the progeny or potential progeny of such a cell. Because 
certain modifications can occur in succeeding generations due to either 

20 mutation or environmental influences, such progeny might not, in fact, be 
identical to the parent cell, but are still included within the scope of the term as 
used herein. 

As used herein, the term "agonist" means an agent that supplements j 
or potentiates the bioactivity of a functional PPAR gene or protein or of a 

25 polypeptide encoded by a gene that is up- or down-regulated by a PPAR 
polypeptide and/or a polypeptide encoded by a gene that contains a PPAR 
binding site or response element in its promoter region. 

As used herein, the term "antagonist" means an agent that decreases 
or inhibits the bioactivity of a functional PPAR gene or protein, or that 

30 supplements or potentiates the bioactivity of a naturally occurring or 
engineered non-functional PPAR gene or protein. Alternatively, an antagonist 
can decrease or inhibit the bioactivity of a functional gene or polypeptide 
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encoded by. a gene that is up- or down-regulated by a PPAR polypeptide 
and/or contains, a PPAR binding site or response element in its: promoter 
region. An antagonist can also supplement or potentiate the bioactivity of a 
naturally occurring . or engineered non-functional gerieor polypeptide encoded 
5 by a -gene i that is up- or down-regulated by a PPAR polypeptide, .and/or 
contains a PPAR binding site or response element in its promoter region. . 

As used herein, the terms "chimeric protein" or -fusion protein" are 
used interchangeably and mean a fusion of a first amino acid sequence 
encoding a PPAR polypeptide with a second amino acid- sequence defining a 

10 polypeptide domain foreign to, and not homologous .with, any domain of a 
^) PPAR polypeptide. A chimeric protein .can include a foreign domain that is 

found- in an organism that also expresses the first protejn, or it can, be an 
"interspecies" or "intergenic" fusion of protein structures expressed -by 
different kinds of organisms. In general, a fusion protein , can be represented 

1 5 by the general foimuja X— PPAR— Y, wherein PP,$R represents a portion of 
the protein which is derived from a PPAR polypeptide, and X and Y are 
independently absent or represent amino acid , sequences which are not 
related to a RPAR sequence in an, organism,, which- includes naturally 
occumng mutants. , ■ v N 

IL Description of Tables 

-Table 1 is a table summarizing the crystal and data statistics obtained 
) from the crystallized ligand binding domain of PPARa. Data on the unit cell 

are presented, including data on the crystal space group, unit cell dimensions, 
25 molecules per asymmetric cell and crystal resolution. 

Table 2 is a table of the atomic structure coordinate data obtained from 
X-ray diffraction from the ligand binding domain of PPARa in complex with a 
ligand. 

Table 3 is a table of the atomic structure coordinate data obtained from 
30 X-ray diffraction from the ligand binding domain : (residues 207-441) of a 
PPAR5 crysta\ (Xu et aL , (1999) Mol Cell 3: 397-403, PDB ID: 1GWX; 
Genbank Accession No. L07592; available online at http://www.rcsb.org/pdb/ ). 
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The coordinate data from the PPAR5 ligand binding domain were used in the 
molecular replacement solution of the PPARa ligand binding domain crystal 
form. 

. Table 4 is a sequence alignment which shows sequence similarities 
5 between the PPARa, PPARy and PPAR8 sequences. The binding site 
residues are denoted by small boxes and represent those* residues lying 
within 5.0 angstroms of the ligand. 

111. General Considerations 

10 The present invention will usually be applicable mutatis mutandis to all 

PPARs, as discussed herein based, in part, on the patterns of PPAR structure 
and modulation that have emerged as a consequence of determining the 
three dimensional structure of PPARa with bound ligand. Analysis and 
alignment of amino acid sequences, and X-ray "and NMR structure 

15 determinations, have shown that nuclear receptors have a modular 
architecture with three main domains: 

1) a variable amino-terminal domain; 

2) a highly conserved DNA-binding domain (DBD); and 

3) a less conserved carboxy-terminal ligand binding domain (LBD). 

20 In addition, nuclear receptors may have linker segments of variable length 
between these major domains. Sequence analysis and X-ray crystallography, 
including the work of the present invention, have confirmed that PPARs also 
have the same general modular architecture, with the same three domains. 
The function of the PPARs in human cells presumably requires all three 

25 domains in a single amino acid sequence. However, the modularity of the 
PPARs permits different domains of each protein to separately accomplish 
certain functions. Some of the functions of a domain within the full-length 
receptor are preserved when that particular domain is isolated from the 
remainder of the protein. Using conventional protein chemistry techniques, a 

30 modular domain can sometimes be separated from the parent protein. Using 
conventional molecular biology techniques, each domain can usually be 
separately expressed with its original function intact or, as discussed herein 
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below, chimeras comprising two different proteins can be constructed, 
wherein the chimeras retain the properties of the individual functional domains 
of the respective nuclear receptors from which the chimeras were generated. 
The amino terminal domain of the PPAR : subtypes is the least 
5 conserved of the three domains. 'This domain is involved in transcriptional 
activation and, in some cases, its uniqueness may dictate selective receptdr- 
DNA binding and activation of target genes by PPAR subtypes. - This domain 
cari display synergistic and antagonistic interactions with the domains of the 
LBD. " ' - " . .. . * 

10 the DNA binding domain has the most highly conserved amino acid 

sequence amongst the PPARs. It typically contains about 70 amino fields that 
fold into- two zinc finger motifs, wherein a zinc atom- coordinates four 
cysteines. The" DBD contains 1 two * peipehdicularly 1 oriented tit-helixes ' that 
extend frarn th^base of the first arid second zinc fingers. L The two zinc fingers 

15 function in concert along with non-zinc finger residues to direct the PPAR to 
specific target 1 site£ 'on DNA and to align receptee 

Various amino acids 1 in ttid DBD influence spacing between two half-sites 
(which usually comprises six nucleotides) for receptor heterodimerization. 
The Optimal spacings facilitate cooperative interactions between'DBDs, and D 

20 box re^idiies are part of the dimerization interface. Other regions of the DBD 
facilitate DNA-proteih and protein-protein interactions required for RXR-PPAR 
heterodimerization; 

The LBD is the second most highly conserved domain in these 
receptors. As its name suggest, the LBD binds ligands. With many nuclear 

25 receptors, including the PPARs, binding of the ligand cari induce a 
conformational change in the LB|D that can, in turn, activate transcription of 
certain target genes. The LBD also participates in other functions, including 
dimerization and nuclear translocation. 

X-ray structures have shown that most nuclear receptor LBDs adopt 

30 the same general folding pattern. This fold consists of 10-12 alpha-helices 
arranged in a bundle; together with several beta-strands, additional alpha- 
helices and linking segments. The major alpha helices and beta-strands 
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have been numbered differently in different publications. This patent will 
follow the numbering scheme of Nolte et al. . (Nolte et al. , (1998) Nature 
395:137-43), where the major alpha-helices and beta-strands are designated 
sequentially through the amino acid sequence as H1 f H2, SI, H2\ H3, H3\ 
5 H4, H5, S2, S3, S4, H6, H7, H8, H9, H10 and HAF. The alpha-helix at the C- 
terminal end, HAF, is also called "helix-AF", u helix-AF2 n or the "AF2 helix 0 . 
Structural studies have shown that most of the alpha-helices and beta-strands 
have the same general position and orientation in all nuclear receptor 
structures, whether ligand is bound or not. However, the AF2 helix has been 

10 found in different positions and orientations relative to the main bundle, 

depending on the presence or absence of the ligand, and also on the ) 
chemical nature of the ligand. These structural studies have suggested that 
many r nuclear receptors share a common mechanism of activation, where 
binding of activating ligands helps to stabilize the AF2 helix in a position and 

15 orientation adjacent to helices-3, -4, and -10, covering an opening to the 
ligand binding site. This position and orientation of the AF2 helix, which will 
be called the "active conformation", creates a binding site for coactivators. 
See, e.g. , Nolte et aL , (1998) Nature 395:137-43; Shiau et al. , (1998) Ce//95: 
927-37. This coactivator binding site has a central lipophilic, pocket that can 

20 accommodate leucine side-chains from coactivators, as well as a "charge- 
clamp" structure consisting primarily of a lysine residue from helix-3 and a 
glutamic acid residue from the AF2 helix. Structural studies have shown that 
coactivator peptides containing the sequence LXXLL (where L is leucine and y 
X can be a different amino acid in different cases) can bind to this coactivator 

25 binding site by making interactions with the charge clamp lysine and glutamic 
acid residues, as well as the central lipophilic region. This coactivator binding 
site is disrupted when the AF2 helix is shifted into other positions and 
orientations. In PPAR-y, activating ligands such as rosiglitazone (BRL49653) 
make a hydrogen bonding interaction with tyrosine-473 in the AF2 helix. 

30 Nolte et al. , (1998) Nature 395:137-43; Gampe et al. , (2000) Mol Cell 5: 545- 
55. This interaction is believed to stabilize the AF2 helix in the active 
conformation, thereby allowing coactivators to bind and thus activating 
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transcription from target genes. With certain antagonist ligahds/ or in the 
absence of any ligand, the AF2 helix may be held' less -tightly In the active 
conformation, or may be free to* adopt other conformation^' This would either 
destabilize or disrupt the coactivator binding site, t thereby reducing ; or 
5 eliminating coactivator binding and transcription from certain target genes. 
Some of the functions of the P PAR protein depend on having the full-length 
amino acid sequence and certain partner molecules, such as coactivators and 
DNA.V However, ^other functions,^ including ligand binding yand ligand- 
deperident conformational .changes, may be observed experimentally using 
1 0 isolated domains, chimeras and mutant molecules. 

As described herein, the LBD of a PPAR^can be expressed, 
crystallized, its three dimensional structure determined with a ligand bound as 
disclosed. in the present invention, and computational methods can be used to 
design Iigands s to te,LBa. i:(a ^ : ; ; ... ^ %>Qv! --. :;£te.i:-omvjini vlt i^r^ov'; s*' T .r::. 

IV. Synthesis of Compound 1 and Intermediates ^ : ^., V; ^ t r ^ 
....... Compound, 1, which.WM PPARa j^BP in the 

present invention, can be conveniently prepared by a general process 
wherein a moiety like (A) , is coupled to an acid (B) using a peptide coupling 
20 reaction or by alkylation of (A) using a suitable' non nucleophilic amine with an 
acid chloride (C). Preferably, R is 1-6 alkyl, which can be hydrolyzed off or, is 
readily hydrolyzable. 

) ' . ? r t 




A preferred synthesis of (A) when Xi is O and X 2 is NH (and R 1 and R 2 



25 are H) is: 
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Note that this synthesis is preferably carried out with the amine where 
the alcohol function is. already alkylated with the acid side chain protected by 
R., For example, when n is1 , Xi is O, X 2 is NH, Y is S, 2 is N, R 1 and are H, 
and R 3 is 4-F 3 C-phenyl : 




Some of the intermediates of type A 'are commercially available while 
others can be synthesized by techniques apparent to a person skilled in the 
art. The synthesis of intermediates of type B is illustrated tfelow.' 

Compound 1 can be made by an alternative method in which 
compounds of formula (D) are reacted with ethyl 2 r brbmo-2 methyl propionate 
to produce an ethyl ester, which may be hydrolyzed to produce the free acid. 




D 



Compounds of formula (D) may be prepared from the reaction between 
compounds of formula (B) and compounds of formula (E) with HOBT / EDC / 
NEts when X 2 is NH or NCH 3 or DIC / DMAP / NEta when X 2 is O. 
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HO 

E 

The invention is further illustrated by the following examples which 
should not be construed as constituting a limitation thereto. 

5 IV.A. Synthesis of Intermediate 1 

0 ' ■ •••-'• •^ - iu*^*^' / 'T-, c !f !:'v" 

Formation of Intermediate 1 follows the procedure described by Stout 
(Stout , (1983) J. /Wed. Chem. 26(6): 808-13). To 4-methoxybenzyl amine 
(25g, 0.18 mol; Aldrich) is added 46% HBr in H 2 0 (106ml, 0.9 mol; Aldrich). 
10 The reaction is refluxed overnight, then the reaction is cooled to 0°C and 
neutralized to pH7 slowly with KOH (s) . The reaction is allowed to stir for -30 
min, then the solid filtered and dried. The solid is redisolved in hot MeOH, 
filtered and the solution cooled to afford 19g (85%) intermediate 1. 



• 15 



IV.B. Synthesis of Intermediate 2 



JL y\ >~ CF 3 . 

20 A solution of ethyl 2-chloroacetoacetate (35.3g, 29.7mL, 0.21 mol) and 

4-(trifluoromethyl)thiobenzamide (44g, 0.21 mol) in EtOH (300mL) is refluxed 
overnight. After cooling to room temperature the solvent is removed in vacuo. 
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The final product (Intermediate 2) is recrystallized from a minimum of MeOH 
to afford 40g (59%) of final product as a white solid. 

IV.C. Synthesis of Intermediate 3 



f-% /r CF > 



To Intermediate 2 (1.84g, 5.8 mmol) in THF is added 1N LiOH (6mL, 6 
mmol) and the reaction stirred at room temperature. After ~3h, the reaction is 
neutralized with 1N HCI, extracted 3 x 100 mL EtOAc, dried over Na 2 S0 4 , ) 
filtered and the solvent removed under vaccum to afford 1.5g (89%) of 
10 Intermediate 3 as a white solid. 

IV. D. Synthesis of Intermediate 4 





n 

HO' 

To intermediate 3 (1g f 7 mmol) in CH 2 CI 2 /DMF (1:1) is added HOBT 
15 (565mg, 4.2 mmol; Aldrich), EDC (800mg f 4.2 mmol; Aldrich) and 
Intermediate 1 (860mg, 7 mmol). The reaction is stirred at room temperature 
for 18h. The solvent is then removed in vacuo, treated with H 2 0 and extracted 
3x 100mL CH 2 CI 2 . The organic phases are then combined and washed with 
1N HCI, dried over Na 2 S04, filtered and evaporated to afford a mixture (A/- 
20 substituted and /V,0-substituted). The mixture is disolved in MeOH and 
treated with 1N NaOH. The reaction is stirred 18h at 50°C. The solvent 
removed in vacuo, dissolved in CH 2 CI 2 , washed with H 2 0, and dried over 
Na 2 S0 4 . The solvent evaporated the residue chromatographed 
(CH 2 CI 2 /MeOH: 99/1) to afford 610mg (47%) of Intermediate 4 as a white 
25 solid. 
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IV.E. , : Synthesis of Intermediate 5 



) 



10 



15 



20 




2-rriethyl-2-[4-{[(4'-m^ 



T ylcarboriyl)amin6]meth^ acid ethyl ester 

1 'To intermediate 4 ^ in DMF (50mL) is added the 

K2CO3 (275mg t 1.99 mmbl) followed by the ethyl 2-iDromo-2 : meth^propanate 
(280|jC l.91 rrimbi; AJdrich) arid the reaction is heateci to 80°6V After 18h /the 
reaction is cooled to room temperature and the solvent removed in vacuo. 
The residue is treated with water (200 mL), extracted 3 x 50mL ChfeCk; dried 
over rsla 2 sd 4l filtered and the solvent removed under vaccum to afford 680mg 



(77%) of Intermediate 5 as a clear oil." 
j\AF. " Synthesis of fcompdund 1 



. • : c ■ -i 




2rhrieftyl-2 r [4-{[(4-methyl-2-[4-tri^ 

ylcarbonyl)amino]methyl}phenoxy]propiqnicacid ■ 
Tp Intermediate 5 (680mg, 1.39 mmol) in MeOH is added 1N NaOH 
(1 .6 mL, 1 .6 mmpl) and the reaction is stirred at 60X. After 18h, the reaction 
is cooled to room temperature and the solvent evaporated. The residue 
treated with 1N HCI, extracted 3 x 20 mL THF and the solvent is removed 
under vacuum to afford 500mg (75%) of £pmpound 1 . : 
V. Production of PPAR Polypeptides . .. 
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The native and mutated PPAR polypeptides, and fragments thereof, of 
the present invention can be chemically synthesized in whole or part using 
techniques that are well-known in the art (See, e.g. , Creighton ; (1983) 
Proteins: Structures and Molecular Principles , W.H. Freeman & Co., New 
5 York, incorporated herein in its entirety). Alternatively, methods which are 
well known to those skilled in the art can be used to construct expression 
vectors containing a partial or the entire native or mutated PPAR polypeptide 
coding sequence and appropriate transcriptfonal/tra^ signals. 
These methods include in vitro recombinant DNA techniques, synthetic 

10 techniques and wo recombination/genetic recombination. See, for 
example, the .techniques described throughout Sambrook et al. , (1989) 
Molecular Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory, New 
York, and Ausubel et ai. t (1989) Current Protocols in Molecular Biology , 
Greene Publishing Associates and Wiley Interscience, New York, both 

1 5 incorporated herein in their entirety. 

A variety of host-expression vector systems can be utilized to express 
a PPAR coding sequence. These include but are not limited to 
microorganisms such as bacteria transformed with recombinant 
bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors 

20 containing a PPAR coding sequence; yeast transformed with recombinant 
yeast expression vectors containing a PPAR coding sequence; insect cell 
systems infected with recombinant virus expression vectors (e.g., baculovirus) 
containing a PPAR coding sequence; plant cell systems infected with 
recombinant virus expression vectors (e.g., cauliflower mosaic virus, CaMV; 

25 tobacco mosaic virus, TMV) or transformed with recombinant plasmid 
expression vectors (e.g., Ti plasmid) containing a PPAR coding sequence; or 
animal cell systems. The expression elements of these systems vary in their 
strength and specificities. 

Depending on the host/vector system utilized, any of a number of 

30 suitable transcription and translation elements, including constitutive and 
inducible promoters, can be used in the expression vector. For example, 
when cloning in bacterial systems, inducible promoters such as pL of 
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bacteriophage X, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like can be 
used. When cloning in insect cell systems, promoters such as the baculovirus 
polyhedrin promoter can be r used. When donirig Iri'plant ck\\ ' systems, 
promoters derived ifrom the genome of plant cellsi such as heat shock 
5 promoters; the promoter for the small subunit of RUBISCOV'tKe promoter for 
the chlbrdphyll f a/b binding protein) or from plant viruses (e.g., the 35S RNA 
promoter of CaMV; the coat protein promoter of TMV) can be used. When 
cloning in mammalian cell systems; promoters derived from the genome of 
mammalian cells (e.g.^ metallothionein promoter) or from mammalian viruses 
10 (e.g., the adenovirus late promoter, the vaccinia virus 7.5K promoter) ? can be 
y usedi li{ When generating cell lines that bontain multiple copies of the tyrosine 
kinase dbmaih DNA; SV40-, 6PV^ and EBV-based vectors can be used with 
an'apprbpriate seleHtable marker/ 1 ' " t%,v ' : ** ii%,tm 

15 Vlv ^ Formation 'of PPARd Lidarid Binding Donvain Crystals 

In one embodiment, the present invention provides T crystals of 
PPARd LBD: The crystals were obtained usiing the methodblpgy disclosed in 
the'^^niples./" The PPARa LBD crystals! which can be native crystals, 
derivative crystals or co-crystais, have orthorhorhBic unit cells (an 
20 orthorhombic unit cell is a unit ceil wherein a * b * c, and wherein a = p = y = 
90°) and space group symmetry P2i2i2. There is one PPARa LBD molecule 
in the asymmetric unit In this PPARa crystalline form, the unit cell has 
dimensions of a= 61.3 A, b= 103.5 A, c= 49.9 A, and a = p = y = 90°. This 
crystal form can be formed in a crystallization reservoir comprising 4-8% PEG 
25 3350, 100-200mM NaF, and 12-16% 2,5 hexanediol. 

In another PPARa crystal form, the unit cell has dimensions of a = 
95.58 A, b = 122.06 A, c = 122,10 A, a = P = y = 90° and belongs to the space 
group P2i2 1 2 1 . There are four PPARa LBD molecules in the asymmetric unit 
of this crystal form. . This crystal form can be formed in a crystallization 
30 reservoir comprising 50 mM bis-tris-propane, 4-6% PEG 3350, 150 mM 
NaN0 3 , 16% 2,5 hexanediol, and 1-3 mMYCI 3 . 



J 
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VI -A . Preparation of PPAR Crystals 

The native and derivative co-crystals, and fragments thereof, disclosed 
in the present invention can be obtained by a variety of techniques, including 
5 batch, liquid bridge, dialysis, vapor diffusion and hanging drop methods (See, 
e.g. , McPherson , (1982) Preparation and Analysis of Protein Crystals , John 
Wiley, New York.; McPherson , (1990) Eur. J. Biochem. ,189:1-23.; Weber , 
(1991)y\ch/. Protein Chem. 41:1-36). In a preferred embodiment, the vapor 
diffusion and hanging drop methods are used for the crystallization of PPAR 

10 polypeptides and fragments thereof. 

, In general, native crystals of the present invention are grown by 
dissolving substantially pure PPAR polypeptide or a fragment thereof in an 
aqueous buffer containing a precipitant at a concentration just below that 
necessary to precipitate the protein. Water is removed by controlled 

1 5 evaporation to produce precipitating conditions, which are maintained until 
crystal growth ceases. 

In a preferred embodiment of the invention, native crystals are grown 
by vapor diffusion (See, e.g. , McPherson , (1982) Preparation and Analysis of 
Protein Crystals , John Wiley, New York.; McPherson , (1990) Ear. J. Biochem. 

20 189:1-23). In this method, the polypeptide/precipitant solution is allowed to 
equilibrate in a closed container with a larger aqueous reservoir having a 
precipitant concentration optimal for producing crystals. Generally, less than 
about 25 iiL of PPAR polypeptide solution is mixed with an equal volume of 
reservoir solution, giving a precipitant concentration about half that required 

25 for crystallization. This solution is suspended as a droplet underneath a 
coverslip, which is sealed onto the top of the reservoir. The sealed container 
is allowed to stand, until crystals grow. Crystals generally form within two to 
six weeks, and are suitable for data collection within approximately seven to 
ten weeks. Of course, those of skill in the art will recognize that the above- 

30 described crystallization procedures and conditions can be varied. 

VLB. Preparation of Derivative Crystals 
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* : Derivative crystals of the present invention, e.g. heavy atom derivative 
crystals, vcan be J obtained by soaking native crystals in mother liquor 
•containing salts of heavy Imetal atoms. Such derivative crystals are useful for 
iphase analysis in the solution of crystals of the present invention, in a 
5 preferred embodiment of the present invention, for example, soaking a native 
crystal in a solution containing methyl-mercury chloride provides' derivative 
crystals suitable for use as isomorphous replacements in determining ^the X- 
ray crystal structure of a PPAR polypeptide. Additional reagents useful for the 
.preparation; of ;tha derivative crystals of the present invention will be apparent 
1 0 to those of skill in the art after review of the disclosure of the present invention 
) presented herein. 

VLC. Preparation of Co-crystals - . a- ^ 

u ;, : Cp-cry§tals r pf the ^present invention can^bej obtained - by soaking a 
15 native ^ crystal vin mother liquor containing compounds known or predicted to 
.. bind: the LBD; of, a FJPAR, or a fragment thereof. Alternatively, co-crystals .can 
be obtained by;Cp-cry§tallizjng a PRARLBD polypeptide or a : fragment thereof 
in the presence, of pne or more compounds known , or predicted to bind the 
polypeptide. In a. preferred embodiment, such a compound is Compound. 1. ; 

VI, P. , Solving a Crystal Structure of the Present Invention 
. i Crystal structures of, the , present invention can be solved using a 
variety of techniques including, but not limited to, isomorphous replacement, 
anomalous. scattering or molecular replacement methods; Computer software 

25 packages will also be helpful in solving a crystal structure of the present 
invention. Applicable .software packages include but are not limited to X- 
PLOR™ program (Brunger , (1 992) X-PLOR, Version 3. 1. A System for X-ray 
Crystallography and NMR, Yale University Press, New Haven, Connecticut; 
X-PLOR is available from Molecular Simulations,. Inc., San Diego, California), 

30 Xtal View (McRee , (1 992) J Mol, Graphics 10: 44-47; X-tal View is available 
from the San Diego Supercomputer Center). SHELXS 97 (Sheldrick (1990) 
Acta Cryst. A46: 467; SHELX 97 is available from the Institute of Inorganic 
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: Chemistry, Georg-August-Universitat, Gottingen, . Germany), HEAVY 
(Terwilliger, Los Alamos National Laboratory) and . SHAKE-AND-BAKE 
(Hauptman , (1997) Curr. Opin.. Struct Biol. 7: 672-80; Weeks et.al. , (1993) 
Acta Cryst D49: .179; available from the Hauptman-Woodward Medical 
5 Research Institute, Buffalo, New York) can be used. See also , Ducruix & 
Geige, (1992) Crystallization of Nucleic Adds and Proteins: A Practical 
Approach , 1RL Press, Oxford, England, and references cited therein. 

VII. Characteri zation and Solution of a PPARa Liaand Binding Domain 
10 Crystal .. : . i .. 

VH«A Unique Structural Differences Between PPARa and Other 

ppars -r;*c* >::r- 5 T>?rr 

i-The. PPARa LBD-ligand structure was solved here' using molecular 

1 5 replacement techniques. The overall folding of the protein backbone, and the 
binding mode of Compound i , iare shown in the ribbon diagram of Figure 1. 
Specific interactions between Compound 1 1 and the protein are shown 
schematically in Figure 2. The structure of the PPARy LBD was solved 
previously 1 in the apb form, i.e., with no ligand,- and also with a 

20 thiazolidinedione ligand (rosiglitazone) and a coactivator peptide (NolteetaK, 
(1998) Nature 395:137-43). The apo structure has also been determined 
independently (Uppenberg et al. , (1998) J. Biol. Chem. 273: 31108-12). In 
addition, the structure of the PPARy LBD has been determined with a partial ) 
agonist, GW0072 (Oberfield et al. , (1999) Proa Nat Acad. Set 96: 6102-106). 

25 The structure of the PPARy LBD has also been determined in the 
heterodimeric complex with RXRa, together with coactivator peptides, with 
rosiglitazone, and also with a carboxylic acid ligand, GI262570 (Gampe et at. , 
(2000) Mot Cell 5: 545-55). The structure of the PPAR8 LBD has been 
determined in- the apo form, and with eicosapentaenoic acid, and with the 

30 synthetic compound GW2433 (XuetaL, (1999) Mot Cell 3: 397-403). 
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To facilitate comparison, the structures were first translated and rotated 
into -a -common 'position- and orientation^ as shown in Figure ;1; This 
superimposition operation was done with the- MVP program (Lambert , (1997) 
in Practical Application of Computer-Aided Drug Design , (Charifspn, ed.), pp. 
5 243-303, Marcel-Dekker, New York) ' by ' first ^- alignihg -the amino acid 
sequences to identify corresponding residues in the three different PPARs, 
and: then rotating and translating so as "to superimpose cbitesporidfhg Ga'dr 
backbone atoms from the aligned residues. 1 With 1 this translation -and rotation, 
most of the major alpha-helices arid beta-strands of the three PPARs are 

10 closely superimposed. 'The AF2 helix is well superimposed in 
.) structures^ but is shifted into a different position in ^certain suburiits ''of 

structures of PPARy either in the absence of ligand (apo), or in the presence 
of the partial agonist GW0072 - This sh^ the'PPAR 
AF2 heliX can shift otit of the Active iconfeiWafion^and that it is mbre* likely to 

15 do so in the absence of a strongly (activating Kgaridi in ttne r 4bdve Histed PPAR 
X-ray -Mructtires; the ligahds that sbf : as^s^^ [ (rosiflittkzbne, 

GI262570, eicosapentaenoic acid, GW2433, Compound 1) are all oriented 
with the acid group near the- AF2 helixrand^they all make a hydrogen bond 
with a tyrosine residue in the AF2 Helix (Tyr464 in PPARa, Tyr473 in PPARy, 

20 Tyr437 in PP&RS)." The partial agonist; GW007i2^ also has a carboxyl group, 
but it is oriented differently, and fails to make any hydrogdn bond with the AF2 
tyrosine. This suggests that the hydrogen bond between the ligand and the 
3 AF2 tyrosine helps" to hold the AF2 helix in the active conformation arid 

thereby facilitate boactivator binding: See , Xu tet al: , (1 999) Moll Cell 3: 397- 

25 403 and Oberfieid et al. , (1 999) Proc. Nat Acad. Set. 96: 61 02-1 06. 

To facilitate discussion, it is usetul to establish a nomenclature for 
regions of the ligand arid the ligand binding pocket. The strongly activating 
ligands in these X-ray structures all have an acid group that binds near the 
AF2 tyrosine; and a lipophilic "tail" that extends into a lipophilic pocket. The 

30 acid group can be called thb acid "headgroup y arid the site near the AF2 helix 
into which it binds can be called the "headgrdup binding site". In rosiglitazorie, 
GI262570 and Compound 1, the lipophilic tail is directed into a lipophilic 
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pocket delineated by helix-2\ helix-3 and the beta sheet This will be called 
the "lower tail pocket" or 'lower pocket". In the structure of PPAR5 bound to 
eicosapentaenpic acid, the eicosapentaenoic acid was found in two different 
binding modes with roughly equal occupancy (Xu et al. t (1999) Mol. Cell 3: 
5 397-403). In both binding modes, the acid headgroup bound in the 
headgroup binding site near Tyr 437. In one binding mode, the lipophilic tail 
was directed into the lower tail pocket. However, in the other binding mode, 
the lipophilic tail was directed into a different pocket, delineated by helix-3 T 
helix-5, the beta sheet and the loop between helix-1 and helix-2. This will be 

10 called the "upper tail pocket." GW2433 has a branched tail, and was found to 

bind with one branch In the lower pocket, and with the other branch Jn the ) 
upper pocket (Xu et ah , (1999) Mol. Cell 3: 397-403). GI262570 has an 
additional lipophilic benzophenone group near the acid headgroup. This 
benzophenone group was . found -to be % directed into : a lipophilic : pocket 

15 delineated by helix-3, heIix-7, helix-1 0 and the loop between helix-1 0 and the 
AF2 helix. This will be. subsequently .referred to as the "benzophenone 

pOCket." ; , .' • : rt- ; - ■ . V . 

Comparing the backbone structures, there are substantial differences 
between PPARa, PPARy and PPAR8 in the "loop" region between helix-2 and 

20 strand-1, the loop region between helix-2' and helix-3, and the loop region 
between helix-1 0 and the AF2 helix. There are also smaller differences in the 
loop region between helix-1 .and helix-2, and the loop region between helix-6 
and helix-7. Helices 2, 2' and 3' are closely superimposed, but have slightly ) 
different lengths, as indicated in Table 4. Helix-1 0 has the same length in all 

25 structures, but bends differently over the ligand binding pocket in PPARa, 
PPARy and PPAR8. Some of these structural differences are visible in Figure 
3. Many of these structural differences lie close to the ligand binding pocket, 
and could therefore be important in the receptor selectivity of different ligands. 
Helix-2' and the loop between helix-2' and helix-3 (the 2-3 joop) 

30 together serve as one wall for the "lower tail pocket" jn all three PPARs. 
Helix-2', and the 2'-3 loop, adopt different conformations in some of the 
different PPARy structures. In particular, helix-2* is shorter in the homodimer 
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structures of PPARy bound to rosiglitazone and GW0072, and in the absence 
of ligand, than it is in the RXRo/PPARy heterodimer stnjctures where 
'PPARy is bound to rosiglitazone or (GI262570. Some of th^se conformational 
differences might result from differences in crystal packiriig. : However/ the 
5 RXRctf PPARy heterodimer represents the closer approximation to the 
biologically active complex, and will be used here to represent the most 
relevant conformation of PPARy. By contrast, helix-2' and the 2'-3 loop adopt 
more-nearly equivalent backbone conformations jn the available X-ray 
structures of PPARa and PPAR8. Considering the conformation in the 

10 RXRcx/PPARy/GI262570 heterodimer structure, heiix-2 9 is longer in PPARa 
and PPARS^ and ' the C-terminal end of the helix adopts different 
conformations in the three PPARs. In' PPARa, the last residue of the helix, 
Leu254, makes the expectdd aipha-helical hydrogen bonds, but bulges away 
from the axis of the helik towards file ligand. This bulged confbrmatibh allows 

1 5 Leu£54 and Va!255 to cover the "bottom" of the lower tail pocket, effectively 
narrowing the mouth of the pocket In PPAR8, the C-terminal end of helix-2' 
is wound more loosely, such that the corresponding residue, Leu226, cannot 
cover the bottom of the : ligand binding pocket Instead, ihe 2-3 loop adopts a 
different conformation that places the side-chain of Trp228 in the bottom of 

20 the lower tail pocket This shortens the lower tail pocket slightly, and also 
constricts the opening to solvent. In PPARyl heiix-2' is bent such that its 
corresponding residue, Ile262, is shifted farther from the pocket Also, the 
next two residues, Lys263 and Phe264, adopt the helicial conformation, 
effectively extending helix-2 by two residues, such that Lys263 is far from the 

25 ligand and cannot cover the solvent channel. This leaves a very wide 
opening tb solvent at this position in PPARy, as depicted by the large, wide 
cavity region above the 2-3 loop in Figure 5. The opening is much narrower 
in PPARa and PPAR5, as depicted in Figures 4 and 6. These variations in 
the backbone conformation would be difficult or impossible to predict from the 

30 previously available X-ray structures with any homology modeling procedure. 
Nonetheless, very accurate models for helix-2' and the 2-3 loop in all three 
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PPARs would be essential for understanding the binding mode and receptor 
selectivity of ligands that occupy the lower tail pocket. 

The loop between helix-1 and helbc-2 (the "1-2 loop") serves as a wall 
at the far end of the upper tail pocket. Residues in this,1-2 loop come close to 
5 the tail of eicosapentaenoic acid (in its upper tail pocket binding mode), and 
close to the fluoro-.chloro-phenyl ring of GW2433 in PPAR5. Several 
backbone amide CO and NH groups are directed towards the ligand binding 
site, and could serve as hydrogen bonding partners .with an appropriately 
designed ligand. However, the backbone conformation in the 1-2 loop is 

10 slightly different in PPARa, PPARy and PPAR5. In particular, the backbone 
amide CO and NH groups have slightly different orientations : in PPARa, 
PPARy and PPAR8. Some of these differences may result from the presence 
of a proline at position 227 in PPARy, whereas PPARa and PPARS have 
asparagipes at this position. These subtle differences would be difficult or 

15 impossible to predict to high accuracy using the previously available X-ray 
structures with standard homology modeling procedures. Nonetheless, a 
very accurate model of this loop conformation would be essential for 
understanding the receptor selectivity of ligands that interact with this loop. 

The C-terminal end of helix-10 and the loop between helix-10 and the 

20 AF2 helix (the "10-AF2 loop") together serve as one wall for the headgroup 
binding site and the benzophenone pocket. An additional wall is provided by 
a glutamine residue in helix-3 (Gln277 in PPARa, Gln286 in PPARy, Gln250 
in PPAR5) that reaches across the benzophenone binding site to make 
hydrogen bonds with backbone CO and NH groups from the 10-AF2 loop. 

25 This glutamine side-chain, and the 10-AF2 loop itself, . adopt different 
conformations in the three PPARs. In PPARa and PPAR5, the glutamine 
side-chain adopts conformations that narrow the benzophenone pocket 
substantially. In addition, in PPARa and PPAR5, the 10-AF2 loop adopts a 
conformation that crowds a phenylalanine from helix-3 (Phe273 in PPARa, 

30 Phe282 in PPARy, Phe246 in PPARS) more in PPARa and PPAR5 than in 
PPARy. These two structural differences cause a slight narrowing of the 
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benzophenone pocket in ' PPARa* and a more substantial narrowing in 
PPARS. This cari'be seen by comparison of PPARa arid PPARS in Figures 4 
■ and 6 with* PPARy in - Figure 5. * the narrowing is sufficient to significantly 
• reduce the binding 'affinity of ligahds that have a behzophenone group'at this 
5 position. 'The narrowing ? iri PPARa and PPARS suggests^that smaller groups 
should be used at ffiTs* ] postti6n id obtain good binding to PPARa, 1 and 
substantially srnalfer groups should be used to obtain good binding to PPARS. 
Aside 'frdm 'thes cKariges that narrow th^ 

there are also structural changes that modulate the shape of the pocket and 

10 the position arid onen^tioh *of pbtential hydrogen bonding " For 

example, the C-termirial end of helix-10 bends inwards toward the ligand 
binding site fffSll^ree'PPARs. ' However^ the bend "is ^slightfy different in the 
different PPARs, placing cbhrespo'ndlhg r side-chains' in' slightly different 
posrtibhs; THese^ variations in^tfie Backbone cbhforrnationrand coTOspbnding 

1 5 changes in side^bhairi cbnfbf-matibh, would bb difficult or impossible to predict 
from the previously available" X-ray structures with any available homology 
modeling procedure: Nonetheless, very accurate models for helix-io, the'lO- 
AF2 loop and the side-chains In this" region would be essehtial for 
understanding the binding mode and "receptor selectivity of ligahds that 

20 occupy the benzophehone pocket. ^ 

Aside from these unexpected differences in the backbone structure of 
PPARa, the present X-ray structure also revealed differences involving side- 
chains. In some cases, these differences involve residue positions where the 
amino acid is : different in PPARa from that in either PPARy or PPARS or both. 

25 In other cases, PPARa may have the same amino acid at a particular residue 
position as PPARy or PPARS or both, but the amino acid side-chain adopts a 
different conformation in PPARa. y 

One of the most important side-chain differences isTyr314 in PPARa, 
a position i that corresponds to His323 in PPARy and His287 in PPARS. In 

30 PPARy and PPARS, this histidine residue makes a hydrogen bond with the 
acidic headgroup of the iigand for all strongly activating ligands for which 
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structures are available. The present PPARa structures shows that Tyr314 
also makes a hydrogen bond with the acidic headgroup of Compound 1. 
However, the tyrosine OH lies farther from the protein backbone than, the 
corresponding. hydrogen bonding atoms in histidine. Consequently,. changing 
5 histidine to tyrosine could potentially require a change in the position and/or 
orientation and/or conformation of the ligand, or changes in the conformation 
of the PPARa protein. The present PPARa crystal structure showed that the 
protein backbone is essentially unchanged in this region of PPARa, compared 
with PPARy and PPAR8. Instead, the acid headgroup of the ligand is shifted 

1 0 and rotated in such a way that it can still make hydrogen bonds with Ser280, 
Tyr314, His440 and Tyr464. This involves small changes in the 
conformations of Tyr314, ,His440 and Tyr464, relative to PPARy and PPAR5. 
Also, this involves a larger change in the conformation of Ser280, vyhich 
unexpectedly adopts a ^conformation, different,, fr^ that , : .of Sei289 in 

1 5 PPARy bound to GI262570. . The PPARa conformation places the Ser280 
side-chain oxygen near the position of the side^chajn oxygen pf Thr253 in 
PPARS : bound to : GW2433. t . The exact shift and rotation, and the exact 
conformations^ of the side-chains of Ser280, Tyr314, His440 and Tyr464, 
would b^ difficult or impossible to predict without this X-ray crystal structure of 

20 PPARa. However, this shift and rotation do significantly affect the position of 
the whole ligand within the ligand binding site. The position and orientation of 
the ligand carboxylate group revealed in this PPARa X-ray structure, and the 
interactions it makes with Ser280, Tyr314, His440 and Tyr464, can serve as a 
template for docking other compounds into PPARa using molecular modeling 

25 procedures. 

This PPARa X-ray structure also revealed numerous other side-chains 
where the conformation in PPARa was different from that in PPARy and/or 
PPAR5, and where the diffence could affect the shape of the ligand binding 
pocket, and the position, orientation and/or conformation of the ligand. These 
30 additional residues with differences include, but are not limited to, Gln277, 
Phe273, His274, I!e354, Leu321 f Met320, Met330 and Glu251. The side- 
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chain conformational differences involving ! Gln277, Phe273 and Ile354 affect 
the volume and detailed shape of the headgroup binding site and/or the 
benzophenone pocket. His274 may affect these pockets indirectly, through its 
effect on Gln277: and Phe273. -Leu321, Met320, Met330 and Glu251 affect 
the shape and volume of the upper and lower tail pockets. Numerous other 
side-chains also affect the size, shape ; and electrostatic character of the 
ligand binding site, and the position, orientation and conformation of the ligand 
. within the. Jigand binding site.- An understanding of-the PPARa, PPARy and 
PPAR8. selectivity of various ligands would depend on having an accurate 
structure of : - each- of the three PPARs, as well as an accurate position of the 
whole iligand .within .the; pocket. -. This PPARa X-ray structure provides in 
accurate protein structure, as well as a template" for modeling alternative 
ligands. 

to. VlLB Characterization of the PPARg Binding Pocket 
The Jigand binding domain of PPARa was co-crystallized with 
Compound 1 , which has the IUPAC name 2-rhethyl-i-[4-{I(4-methyl-2-[4- 
trifluoromethylphenyl] thiazoI-5-yl-carbonyl) amino] methyl} pherioxy] propionic 

acid. ■!■ . ■ 




Compound 1 

Compound 1 is an agonist of hPPARa and is useful for treatment of 
hPPARa mediated diseases or conditions including dyslipidemia, syndrome 
X, heart failure, hypercholesteremia, cardiovascular disease, type II diabetes 
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mellitus, type I diabetes, insulin resistance, hyperlipidemia, obesity, 
inflammation, anorexia bulimia and anorexia nervosa. 

Figures 1, 2 and 7 depict the conformation and orientation of the ligand 
Compound 1 in the binding site. Figure 1 depicts the oyerall orientation of the 
5 a helices and p strands of the ligand binding domain of PPARa as it binds 
Compound 1. Compound 1 is presented as a spacefilling model. 

A more specific graphical description of the residues of the PPARa 
LBD involved in ligand binding is presented in- Figure 2. Figure 2 is a 
schematic diagram depicting those residues of the PPARa LBD that interact 
10 with Compound J as it is bound in the binding pocket of the PPARa LBD. 
Note that Figure 2 is a schematic; diagram, and highlights residues that 
interact with the ligand and does not indicate intermolecular distances; : 

VII. C. Hydrogen Bonding in the Binding Pocket of the PPARa LBD 
15 The hydrogen bonding, scheme of the solvated binding : pocket is 

presented Lin Figure 7. In Figure 7, atoms are : shaded according to element, 
with carbon, fluorine, nitrogen, oxygen, sulfur and hydrogen in progressively 
lighter shades of gray., Sulfur is depicted with a slightly bigger ball, while 
hydrogen, fluorine and oxygen are depicted with slightly smaller balls. v The 
20 hydrogen atoms shown here were not visible in the electron density, and were 
instead modeled into reasonable conformations (using standard bond lenghts 
and angles) to obtain possible hydrogen bond interactions, shown here with 
strings of small white balls. Each interaction is annotated with its distance in 
angstroms. Residues of the PPARa LBD binding pocket involved in hydrogen 
25 bonding are labeled. It is observed that hydrogen bonding relationships exist 
between the side chains of PPARa LBD residues and the ligand, between the 
side chains of PPARa LBD residues and solvent molecules, and between the 
ligand and solvent molecules. The hydrogen bonding pairs are identified by 
dotted lines. 

30 Figure 7 highlights hydrogen bonding that occurs between solvent 

molecules (indicated as crosses in Figure 7) and the side chains of PPARa 
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. LBD residues in the binding pocket. Residues that hydrogen bond to solvent 
and are visible in Figure:? include Ser-280, Ser-283, Thr-279 and AJa-333: 

Additionally, Figure 7 highlights those residues of the PPARa LBD that 
hydrpgen bond directly tothe ligand. Residues of the PPARa LBD in'vvhich 
5 side chains hydrogen bond directly to the ligand are visible in Figure 7, and 
include His-440, Tyr-464, Tyr-314, Ser 280 and Thr-279. 

VII. D. Generation of EasilyrSolved PPAR Crystals — - - 

The, present;/ invention /discloses a substantially ' pure PPAR LBD 

1 0 polypeptide in crystalline-form, ,iln;,a preferred embodiment, exemplified in the 
Figures and ; Laboratory Examples, PPARa is crystallized with bound ligand. 
Crystals^are formed from PPAR LBD polypeptides that are usually expressed 
by a cell culture, such as £. coll. Bromo-, iodo- and substitutions can be 
included during; the preparation of; crystal; forms and ^n 'act as heavy atom 

15 substitutions, in PPAR ligands and crystals of PPARs. This method can : be 
advantageous for the phasing of the crystal, which is a;criicial, and ^ometlmeis 
limiting*: step in solving the three-dimensional structure of a crystallized entity. 
Thus,- the : need for generating the. heavy metal derivatives traditionally 
employed in crystallography can be eliminated. After the three-dimensional 

20 structure of a PPAR or PPAR LBD with or without a ligand bound is 
determined, the resultant three-dimensional structure can be used in 
computational methods to design, synthetic ligands for PPARa and other 
PPAR polypeptides. Further activity structure relationships can be 
determined through routine testing, using assays disclosed herein and known 

25 in the art. ■ .. . - 

VIII. Uses of PPARa Crystals and the Three-Dimensional Structure of the 
Ligand Binding Domain of PPARa - 

30 VI II. A. Design and Development of PPAR Modulators 

The knowledge of the structured the PPARa ligand binding dbmain 
(LBD), an aspect of the present invention, provides a tool for investigating the 
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mechanism of action of PPARa and other PPAR polypeptides in a subject. 
For example, various computer modelleing programs, as described herein, 
can predict the binding of various ligand molecules to the LBD of PPARa, 
PPARy or PPAR5. Upon discovering that such binding in fact takes place, 
5 knowledge of the protein structure then allows design and synthesis of small 
molecules that mimic the functional binding of the ligand to the LBD of 
PPARa, and to the LBDs of other PPAR polypeptides. This is the method of 
"rational" drug design, further described herein. 

Use of the isolated and purified PPARa crystalline structure of the 
10 present invention in rational drug design is thus provided in accordance with 
the present invention. Additional rational drug design techniques are 
described in U.S. Patent Nos. 5,834,228 and 5,872,011, incorporated herein 
in their entirety. 

.Thus, in addition to the compounds described herein, other sterically 
15 similar compounds can be formulated to interact with the key structural 
regions of a PPAR in general, or of PPARa in particular. The generation of a 
structural functional equivalent can be achieved by the techniques of 
modeling and chemical design known to those of skill in the art and described 
herein. It will be understood that all such sterically similar constructs fall 
20 within the scope of the present invention. 

V1I1.A.1. Rational Drug Design 

The three-dimensional structure of ligand-binding PPARa is 
unprecedented and will greatly aid in the development of new synthetic 

25 ligands for a PPAR polypeptide, such as PPAR agonists and antagonists, 
including those that bind exclusively to any one of the PPAR subtypes. In 
addition, the PPARs are well suited to modern methods, including three- 
dimensional structure elucidation and combinatorial chemistry, such as those 
disclosed in U.S. Patent No. 5,463,564, incorporated herein by reference. 

30 Structure determination using X-ray crystallography is possible because of the 
solubility properties of the PPARs. Computer programs that use 
crystallography data when practicing the present invention will enable the 
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rational design of ligands,tP these receptors: Programs such as RASMOL 
(Biomolecular Structures t Group, Glaxo. Wellcome: Research & Development 
Stevenage,. Hertfordshire, - «UK Version 2.6, August .^1995, Version ; 2.6.4, 
.December ,1.998, Copyright ©_Roger .SayIe[. 1992-1 999) can be used with the 
5 atomic., structural., coordinates, from, crystals, generated by practicing the 
invention; or used, to practice. the invention :by : generating three-dimensional 
modejs and/or determining the structures . involved in _ ligandu, binding. 
Computer programs, such as those sold under the registered:»trademark 
INSIGHT ll®uand such;as GRASP (Nicholls et al. ,r(1991);Profe//7S 11: 282) 

1 0 allow for further manipulations and the ability to introduce new structures. .. In 
addition, high. throughput binding and bioactivity assays: can be devised :iising 
purifjpd : recombinant, protein and modem reporter gene transcription assays 
rknown to those of skill in the; art in: order to refine? the ;activity of a designed 
;ligand. .|, ,*. i vvt^v!- t.^ir : ::.:;3E-:;«n.':..v.b Z'At ;ii r J-:*--v 0:':. o\ 

15 A method of identifying modulators : of e the activity^ of a/ PPAR 

polypeptide using rational drug design is thus provided in accordance with the 
present invention. The method comprises: designing, a . potential modulator for 
a PPAR polypeptide of ; the present invention that will form non-covalent 
.interactions with amino, acids ; in the Iigand r binding : ,pocket :: based upon the 

20 crystalline ... structure of the PPARa LBD, polypeptide; a synthesizing the 
modulator; • and determining whether the potential modulator modulates the 
activjty of f the PPAR polypeptide. In a preferred embodiment, the modulator is 
designed for a PPARa polypeptide. Preferably, the PPARa polypeptide 
comprises the nucleic acid sequence of SEQ ID.NO:1; and the PPARa LB D 

25 comprises the nucleic acid sequence SEQ ID NO:3. The determination of 
whether the modulator modulates the biological activity of a PPAR 
polypeptide- is made in accordance, with the screening methods disclosed 
herein, or , by other screening methods known to those of skill in, the art. 
Modulators, can be synthesized using techniques known to those of ordinary 

30 skill in the art.; v -r- . r :v c*. 

In an alternative embodiment, a method of designing a modulator of a 
PPAR polypeptide in accordance , with the,, present invention is disclosed 
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comprising: <a) selecting a candidate PPAR ligand; (b) determining which 
amino acid, or amino acids of an PPAR polypeptide interact with the ligand 
using a three-dimensional model of a crystallized PPARa LBD; (c) identifying 
in a biological assay for PPAR activity a degree to which the ligand modulates 
5 the activity of the PPAR polypeptide; (d) selecting a chemical modification of 
the ligand wherein the interaction between the amino acids of- the PPAR 
polypeptide and the ligand is predicted, to be modulated by the chemical 
modification; (e) performing the chemical modification on the ligand to form a 
modified ligand; (f) contacting the modified ligand with the PPAR polypeptide; 

10 (g) identifying in a biological assay for PPAR activity a degree to which the 

modified ligand modulates the biological activity of the PPAR polypeptide ; } 
and (h) .comparing the. biological activity of the PPAR polypeptide 5 in the 
: presence^ of modified r> ligand with the- biological ^activity- of the PPAR 
polypeptide in the presence of the unmodified ligand, whereby a modulator of 

15 Tan PPAR polypeptide is designed.' 

' ^ VllkA.2: f Methods for Using the PPARa LBD Structural 
* ^ Coordinates For Molecular Design : 5 
' for the first time, the present invention permits the use of molecular 
20 design techniques to design, select and synthesize chemical entities and 
compounds, including modulatory compounds, capable of binding to the 
ligand binding pocket or an accessory binding site of PPARa and the PPARa 
LBD, in whole or in part. Correspondingly, the present invention also provides ) 
for the application of similar techniques in the design of modulators of any 
25 PPAR polypeptide. 

In accordance with a preferred embodiment of the present invention, 
the structure coordinates of a crystalline PPARa LBD can be used to design 
compounds that bind to a PPAR LBD (more preferably a PPARa LBD) and 
alter the properties of a PPAR LBD (for example, the dimerizatioh or ligand 
30 binding ability) in different ways. One aspect of the present invention provides 
for the design of compounds that can compete with natural or engineered 
ligands of a PPAR polypeptide by binding to all, or a portion of, the binding 
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sites on a PPAR LBD. The present invention also provides for the design of 
compounds that can bind to alii or a portion of, an accessory binding site oh a 
PPAR ! that is already binding * a ligand. Similarly, noncompetitive 
agonists/ligands that bind to and modulate PPAR LBD activity, whether or not 
5 hit is bound to another chemical entity, can be designed using the PPAR LBD 
structure coordinates of this invention. 

A second design approach- is to probe a -PPAR or PPAR LBD 
(preferably a PPARa or PPARa LBD) crystal with molecules comprising a 
variety . of different chemical entities :to determine optima! sites for interaction 

10 between candidate PPAR or PPAR LBD modulators and the polypeptide. For 
example; high resolution X-ray diffraction " data 2 ^ collected "from' crystals 
/saturated with solvent allows' the determination of the site where each type of 
solvent molecule adheres! SmairmolMuleVthat bind tightly' t6 those sites can 
then; be" tiesigne# : and synthesized arid tested for their PPARa ? modulator 

15 activity. Representative -designs 1 are also disciosfed ih - published PCT 
application WO- 99/26966. : ^ v ->v; , ; : - 

Once a cbmputaitiorially-designed ligand 1 is synthesized usirijg the 
methods of the preserit ihventibn or other methods known to those of skill in 
the art; "&£says ,; can ? be used to establish rt$ efficacy of the ligand as a 

20 modulator of PPAR (preferably PPARa) 'activity. After such assays, the 
ligands can be further refined by generating intact PPAR, or PPAR LBD, 
crystals with a ligand bound to the LBD. The structure of the ligand can then 
be further refined using the bhemical modification methods described herein 
and known to those of skill in the art, in order to improve the modulation 

25 activity or the binding affinity of the ligand. This process can lead to second 
generation ligands with improved properties. 

Ligands also can be selected that modulate PPAR responsive gene 
transcription by the method of altering the interaction of co-activators and co- 
repressors with their cognate PPAR. For example, agonistic ligands can be 

30 selected that block or dissociate a co-repressor from interacting with the 
PPAR, arid/or that promote binding or association of a co-activator. 
Antagonistic ligands can be selected that block cbractivator interaction and/or 
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promote co-repressor interaction with a target receptor. Selection can be 
done via binding assays that screen for designed ligands having the desired 
modulatory properties. Preferably, interactions of a PPARa polypeptide are 
. targeted. Suitable assays for screening that can be . employed, mutatis 
5 mutandis in the present invention, are described in as described in Nichols et 
aL, (1998)y4na/. Biochem. 257: 112-19 and Xu'etaL , (1999) Mot. Ceil 3: 397- 
403, which are incorporated herein in their entirety by reference. 

VIH.A.3. Methods of Designing PPARa LBD r Modulator 

10 . Compounds \ . 

The design of candidate substances, also referred to as "compounds" 
or rcandidate compounds", that bind to or inhibit PPAR LBD-mediated activity 
according to the present invention generally involves consideration of two 
factors. First, the compound, must be capable of physically and structurally 

15 associating with a PPAR LBD. Non-covalent molecular interactions important 
in the association of a PPAR LBD with its substrate; include hydrogen 
bonding, van der Waals interactions and hydrophobic interactions. 

Second, the compound must be able to assume a conformation that 
allows it to associate with a PPAR LBD. Although certain portions of the 

20 compound will not directly, participate jn this association with a PPAR LBD, 
those portions can still influence the overall conformation of the molecule. 
This, in turn, can have a significant impact on potency. Such conformational 
requirements include the overall three-dimensional structure and orientation of 
the chemical entity or compound in relation to all or a portion of the binding 

25 site, e.g., the ligand binding pocket or an accessory binding site of a PPAR 
LBD, or the spacing between functional groups of a compound comprising 
several chemical entities that directly interact with a PPAR LBD. 

The potential modulatory or binding effect of a chemical compound on 
a PPAR LBD can be analyzed prior to its actual synthesis and testing by the 

30 use of computer modeling techniques that employ the coordinates of a 
crystalline PPARa LBD polypeptide of the present invention. If the theoretical 
structure of the given^ compound suggests Insufficient interaction and 



(C) 2003 Copyright Derwenl Information Ltd. 



WO 02/064632 



PCT/US02/03278 



^52- 

asspciation between it and a .PPAR LBD, synthesis and testing of the 
compound u\s. obviated . ^Howevier, if computer modeling indicates 1 a strong 
interaction, the molecule can then be synthesized and tested for its ability to 
bind. and modulate the > activity, of a PPAR LBD.vUn this manner, synthesis of 
5 unproductive or inoperative compounds can be avoided. 1 s 

A modulatory or other binding compound of a PPAR LBD polypeptide 
(preferably a PPARa LBD) can be computationally evaluated and designed 
via a series of steps in which chemkial entities or fragments are screened and 
: selected for:; tJieir abiJity : to associate with the. individual binding sites or other 
1 0 areas of a crystalline PPARccLBD polypeptide of the present invention. 

One of several methods can be used to screen chemical entities or 
-fragments 7 for their ability to associate- with a' PPAR LBD and, more 
pa'fficularty; !: with We iridividual 

binding pocket or %h accessory binding^ sfte>~Tfiis process can begih by visual 
1 5 inspection 3 of, for f iexiample, ihd ligand 1 binditfg pocket rbn f s ^ <xiniputer screen 
'based oh thePPARcTLBD atomic raoirciihates; in Table d: 1 Selected fragments 
or chemical entities- can then be pbsitioried in a variety of orient&tidns, or 
• docked," within an"- individual binding site of a PPARct : LBD as defined herein 
above. Docking can be accomplished using software ^ prbgrarhs siich as those 
20 available under the tradenames QUANTA™ (Moleojlar Simulations Inc., San 
Diego, California) and SYBYL™ (Tripos, Incl; St: Louis; Missouri), followed by 
energy minimization and molecular, dynamics' with standard molecular 
mechanics forcefields, such as CHARM (Brooks et al. J (^983) J. Comp. 
Chem., 8: 132) and AMBER 5 (Case et ah , (1997), AMBER 5, University of 
25 California, Sah Francisco; Pearlman et al. , (1995) Comput. Phys. Commun. 
91: 1-41). 

Specialized computer programs can also assist in the process of 
selecting fragments or chemical entities. These include: 

1. GRID™ program, version 17 (Goddford , (1985) J. Med Chem. 28: 
30 849-57), which is available from Molecular Discovery Ltd., Oxford, UK; 

2. MCSS™ program (Miranker & Karplus; (1991) Proteihs 11: 29-34), 
which is available from Molecular Simulations, Inc., San Diego, California; 
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3.. AUTODOCK™ 3.0 program (Goodsell & Olsen , (1990) Proteins 8: 
195-202), which, is available from the Scripps Research Institute, La Jolla, 
California; . ~ 

4. DOCK™. 4.0 program (Kuntz et aL , (1992) J. MoL Biol. 161: 269- 
5 88), which is available from the University of California, San Francisco, 

California; . - f , . , • 

5. FLEX-X™ program (See, Rarey et aL , (1 996) J. Compute Aid. MoL 
Des. 10:41-54), which is available from Tripos, Inc., St Louis, Missouri; 

6. MVP program (Lambert , : (1997) in Practical Application of 
10 Computer-Aided Drug Design . (Charifson, ed.) Marcel-Dekker, New York, pp. 

243-303); and.. , . . , ) 

7. LUDI™ program (Bohm, (1992) J. Comput Aid., MoL Des., 6: 61- 
78), which is available from Molecular Simulations, Inc., San Diego, California. 

Once suitable chemical entities or fragments have beeirselected, they 
15 can be assembled into a single compound or modulator. Assembly can 
proceed by visual inspection of the relationship of the fragments, to each other 
on the three-dimensional image displayed on a computer screen in relation to 
the structure coordinates of a^PPARot LBD. Manual model- building using 
software such as QUANTA™ or SYBYL™ typically follows. 
20 Useful programs to aid one of ordinary skill in the art in connecting the 

individual chemical entities or fragments include: 

1. CAVEAT™ program (Bartlett et al. , (1989) Special pub;, Royal 
Chem. Soc. 78: 182-96), which is available from the University of California, ) 
Berkeley, California; 

25 2. 3D Database systems, such as MACCS-3D™ system program, 

which is available from MDL Information Systems, San Leandro, California. 
This area is reviewed in Martin , (1992) J. Med. Chem. 35: 2145-54; and 

3. HOOK™ program (Eisen et al. , (1994). Proteins 19: 199-221), 
which is available from Molecular Simulations, Inc., San Diego, California. 

30 Instead of proceeding to build a PPAR LBD modulator (preferably a 

PPARa LBD modulator) in a step-wise fashion one fragment or chemical 
entity at a time as described above, modulatory or other binding compounds 
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can: be designed as a whole or de novo using the structural coordinates of a 
. crystalline PPARa LBD polypeptide of the -present invention and either' an 
. empty., binding site ' or ; optionally including' 'some portiori(s) of a" known 
mpdulator(s). .... Applicable '/methods can> ; employ the' following software 
5 /programs:- . ^b- . - - ' u:-r -s - ~ r / :v - <v ■*-•■ * * 
1. LUDI™ program (Bohm , (1992) J: Comput. Aid. MoL Dei, 6: :? 61- 
.78), which is available from Molecular Simulations, Inc:, San Diego, California; 

; . 2. ; LEGEND™ program (Nishibata &^ltai r (1991) Tetrahedron 47: 
8985); and ,.^.0 

10 , ; ; 3. LEAPFROG™, which is available from Tripos Associates, St. Louis, 
Missouri. ■ivv-.v :v r v:-^-- '^k.i: 

Other molecular modeling techniques ban also be iemplby^d :in 
accordance with this invention: See^ B-g .^Coheri ef 5 aL ; (1 990) 3; Med. Chem. 
33: 883-94. See also , Navia & Murcko , (1992) Cu/r. Opln. Stn& BidlL 2: 202- 

15 10; U.S. Patent No. 6,008,033, herein incorporated by referenced 

Once a compound has been -designed of sel^e'd by ! tHe ? %b6ve 
methods, the efficiency with which that compound can bind to a PPARa LBD 
can be tested and optimized by computational evaluations By way of 
particular example," a compound that has been designed or selected to 

20 function as a PPARa LBD modulator should also preferably traverse a volume 
not overlapping that occupied by the binding site when it is bound to its native 
ligand. Additionally, an effective PPAR LBD modulator should preferably 
demonstrate' a relatively small difference in energy bbtWeen its bound and 
free- states (i.4., a small deformation energy of binding). Thus, the most 

25 efficient PPAR LBD modulators should preferably be designed with a 
deformation energy of binding of not greater than about 10 kcal/mole, and 
preferably, not greater than* 7 kcal/mole. It is possible for PPAR LBD 
modulators to interact with the polypeptide in more than one conformation that 
is similar in overall binding energy. In those cases, the deformation Energy of 

30 binding is taken to be the difference between the energy of the free compound 
smd the average energy of the conformations observed when the modulator 
binds to the polypeptide. 
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. A compound designed or selected as binding to a PPAR polypeptide 
(preferably a PPARa LBD polypeptide) can be further computationally 
optimized so that in its bound state it would preferably lack repulsive 
electrostatic interaction with the target polypeptide. Such non-complementary 
5 (e.g., electrostatic) interactions include repulsive charge-charge, dipole-dipole 
and charge-dipole interactions. Specifically, the. sum of all electrostatic 
interactions between the modulator and the polypeptide when the modulator 
is bound to a PPAR LBD preferably make a neutral or favorable contribution 
to the enthalpy of binding. 

10 . Specific computer software is available in the art to evaluate compound 

deformation energy and electrostatic interaction. Examples of programs 
designed for such uses include:. . , 

rl 1-« Gaussian 98™, which is available from Gaussian,' Inc., Pittsburgh, 
Pennsylvania;, ;- : „ . \m v: . . . ;> r 

15 2. AMBER™ program, version 6.0, which, is available from the 

University of California at San Francisco; 

3. QUANTA™ program, which is available from Molecular Simulations, 
Inc., San Diego, California; t ...... 

4. CHARMm® program, which is available from Molecular Simulations, 
20 Inc., San Diego, California; and v : 

4. Insight II® program, which is available from Molecular Simulations, 
Inc., San Diego, California. 

These programs can be implemented using a suitable computer J 
system. Other hardware systems and software packages will be apparent to 
25 those skilled in the art after review of the disclosure of the present invention 
presented herein. 

Once a PPAR LBD modulating compound has been optimally selected 
or designed, as described above, substitutions can then be made in some of 
its atoms or side groups in order to improve or modify its binding properties. 
30 Generally, initial substitutions are conservative, i.e., the replacement group 
will have approximately the same size, shape, hydrophobicity and charge as 
the original group. It should, of course, be understood that components 
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knqwn in the art to alter conformation should be avoided. Such substituted 
chemical compounds can then be analyzed for efficiency of fit to a PPAR LBD 
binding site using the same computer-based approaches described in detail 
above..- .. ' - : ^ r - l - . 

5 VIII .B. Distinguishing Between PPAR Subtypes « ;.. ; 

. .The . present invention discloses the ability to generate new synthetic 
ligands to- distinguish between- PPAR ^subtypes.: As- described herein, 
computer-designed ligands can be generated that distinguish between PPAR 
subtypes, thereby allowing tthe generation of either tissue specific or function 

10 ; specific ligands. The atomic structural coordinates disclosed in the present 
, invention reveal structural details unique to PPARa. These structural details 
can be exploited when a novel ligand is designed using the 'methods of the 
present invention orrother ligand design 1 methods known In the "art. The 
structural features that differentiate, for example; a PPARa from a PPARy can 

15 be targeted in ligand design^ Thus; for example, a ligand can be designed 
that will recognize PPARa, while riot interacting with other PPARs^oreven 
with moieties having similar structural features. Prior to the- disclosure of the 
present invention, the ability to target a PPAR subtype was unattainable: " 
; The present invention also pertains to a method for designing an 

20 agonist or modulator with desired levels of activity ori : the three subtypes, 
PPARa, PPARy- and PPAR5. In a preferred embodiment, the method 
comprises obtaining atomic coordinates for structures of the PPARa, PPARy 
and/or PPAR5 ligand binding domains; The structures can comprise PPARa, 
PPARy and PPAR5 each bound to various different ligarids, and also can 

25 comprise structures where no ligand is present. The structures can also 
comprise models where a compound has been docked into a particular PPAR 
using a molecular docking procedure, such as the MVP program disclosed 
herein. Optionally, the structures comprise rotated and translated so as to 
superimpose corresponding Coc or backbone atoms; this facilitates the 

30 comparison of structures. 

The PPARa, PPARy and PPAR5 structures can also be compared 
using a computer graphics system to identify regions of the ligand binding site 



(0 2003 Copyright Derwent Information Ltd. 



WO 02/064632 



PCT/US02/03278 



-57- 

that have similar shape and electrostatic character, and to identify regions of 
the ligand binding site that are narrowed or constricted in one or two of the 
PPARs compared with the others). Since these three PPARs are subject to 
conformational changes, attention is paid to the range of motion observed for 
5 each protein atom over the whole collection of /structures. The ligand 
structures, including both those determined by X-ray crystallography and 
those modeled using molecular docking procedures, can be examined using a 
computer graphics system to identify ligands where a chemical modification 
could increase or decrease binding to a particular PPAR, or decrease activity 
10 against a particular PPAR. Additionally or alternatively, the chemical 
modification can introduce a group into a volume that is normally occupied by 
an atom of thatpPAR. 

~; fOptionallyr to, selectively decrease activity against a particular PPAR, 
the chemical ,. modification, can be made so as to occupy volume that is 

15 normally occupied by atoms of that particular PPAR, but not by atoms of the 
other PPARs. To increase, activity against a particular PPAR, a chemical 
modification can be made that improves interactions with that particular 
PPAR. To selectively increase activity against a particular PPAR, a chemical 
.modification can be made that improves the interactions with that particular 

20 PPAR, but does not improve the interactions with the other PPARs. Other 
design principles can also be used to increase or decrease activity on a 
particular PPAR. 

Thus, various possible compounds and chemical modfications can be 
considered and compared graphically, and with molecular modeling tools, for 
25 synthetic feasibility and likelihood of achieving the desired profile of activation 
of PPARa, PPARy and PPAR5. Compounds that appear synthetically 
feasible and that have a good likelihood of achieving the desired profile are 
synthesized. The compounds can then be tested for binding and/qr activation 
of PPARa, PPARy and PPARB, and tested for their overall biological effect 

30 
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- VIII.C. Method of Screening for Chemical and Biological Modulators of 
r .; . t . r ; ^ the Biological ActivitV'Of PPARg • ^ r ' : r > ■ tv - : ' Vv:: ^ if "" 1 "' ; ' ' 
<- * A candidate substance identified 1 according to a~screening assay of the 
■present invention has-an ability to modulate the biological activity of a PPAR 
5 on a ; PPAR LBD"polypeptider>^lri f S preferred embodiment", such a candidate 
: compound -can;^h*ave r utility in the: treatment of disorders 11 and conditions 
associated with the biological activity of a PPARa" 'or a PPARa 4 LBD 
polypeptide; including lipid homeostasis. - ^ » ' - - ! * f 
In a cell-free system; "the'methbd 
10 rcontrol system comprising 'a 6 crystalline PPARa pdlypepticle ; arid a ligiand 
^) which is. capable ;of tbinding^to: the* polypeptide; establishing a tesFsystem 

comprising ,ay crystalline', PPARan polypeptide; the > ligand, - and a candidate 
^compoy nd; : and, determining whether, the; candidate connipou rid modulates the 
: activity, of the polypeptide by; comparison, of the test and icontrol systems*: : A 
15 representative ligand; comprises Compound .1; pn other small moleculer and in 
this embodiment, the : biolpgical activity<or : property screened includes binding 

. affinity.. .... . ^tji.-;, n-\. .:*jr.-*.- ^. m ,./:■'.->. - > • 

., . Jn another. : jembptflrn^nt;^ 0 ^ ;.the ji Jriyfsntlpn, ■ a ... crystalline fomi of a 
PPARa polypeptide a pr a : ;catalytic or immunogenic fragment or oligopeptide 
20 thereof,pan be used for screening libraries of. compounds in any of a variety 
of drug screening techniques. The fragment employed, in such a screening 
can be affixed to a solid ^support The formation of binding complexes, 
J between a crystalline PPARa. polypeptide and the agent* being tested, will be 

detected. In a preferred embodiment, the crystalline PPARa polypeptide has 
25 an amino acid sequence of SEQ ID NO:2. When a PPARa LBD polypeptide 
is employed, a preferred embodiment will include a crystalline PPARa 
polypeptide having the amino acid sequence of SEQ ID NO:4. 

Another technique for drug screening which can be used provides for 
high throughput screening of compounds having suitable binding affinity to the 
30 protein of interest as described in published PCT application WO 84/03564, 
herein incorporated -by reference. In #\\s method, as applied to a crystalline 
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polypeptide of the present invention, large numbers of different small test 
compounds are synthesized on a solid substrate, such as plastic pins or some 
other surface.,. The test compounds are reacted with the crystalline 
polypeptide, or fragments thereof . Bound polypeptide is then detected by 
methods well known to those of skill in the art. ;The crystalline polypeptide 
can also be placed directly onto plates for use in the aforementioned drug 
screening techniques. ~ 

In yet another embodiment, a method of screening for a modulator of a 
PPARa or. a PPARa LBD polypeptide comprises: providing a library of test 
samples; contacting a crystalline form of PPARa or a crystalline form of an 
PPARa LBD polypeptide with each-test sample; detecting an interaction 
between a test sample and a crystalline form of a PPARa or a crystalline form 
of a PPARa LBD polypeptide; identifying; a: test sample that interacts with a 
crystalline^ form >of a^ PPARd or a crystalline form of a PPARa LBD 
polypeptide; and isolating a test sample that interacts with a crystalline form of 
a PPARa or a crystalline form of a PPARa LBD polypeptide. 

In each of the foregoing embodiments, an interaction can be detected 
spectrophotometrically, radiolbgically or immunologically. An interaction 
between a crystalline form of a PPARa or a crystalline form of a PPARa LBD 
polypeptide and a test sample can also be quantified using methodology 
known to those of skill in the art. 

In accordance with the present invention there is also provided a rapid 
and high throughput screening method that relies on the methods described 
above. This screening method comprises separately contacting each of a 
plurality of substantially Identical samples with crystalline form of a PPARa or 
a crystalline form of a PPARa LBD and detecting a resulting binding complex. 
In such a screening method the plurality of samples preferably comprises 
more than about 10 4 samples, or more preferably comprises more than about 
5 x10 4 samples. ~ 

VII LP. Method of Identifying Compounds Which Inhibit Llgand Binding 
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In one aspect of the present Invention, an assay method for identifying 
, a compounds ; that inhibits binding • of^ a ligand to a PPAR ^polypeptide is 
disclosed.; A ligand of PPARa, such as Compound 1, can be used in the 
assay method as the ligand against which the inhibition by a test compound is 
5 gauged. The method comprises (a) incubating a PPAR polypeptide with a 
ligand ^in ithe^ presence of a test inhibitor compound; (b) determining an 
amount of ligand that is bound to the PPAR polypeptide, wherein decreased 
binding , of ligand, to the PPAR polypeptide in the presence of the test inhibitor 
compound relative to binding in the absence of the test inhibitor compbund is 
1 0 indicative of inhibition;, and (c) identifying the test compound as an inhibitor bf 
ligand binding if decreased ligand binding is observed. Preferably, the ligand 
is Compound 1. / z :>. >. 

In another aspect of the present invention, the disclosed assay method 
can be/used iaithe structural refinement of candidate PPAR inhibitors/ For 
15 example, multiple rounds of optimization can be followed' by gradual structural 
changes -in a ; strategy : of ^inhibitor design. A. strategy such as this is made 
: possible by :the disclosure of the coordinates of the PPARa LBD. 



IX. ^ Design. Preparation and Structural Analysis of Additional PPARa and 
20 PPARd LBD Mutants and S^cturat Eduivafehte r 

The present invention provides for the generation of PPAR and PPAR 
mutants (preferably PPARa and PPARa LBD mutants), and the ability to 
solve the crystal structures of those that crystallize. More particularly, through 
the provision of the three-dimensional structure of a PPARa LBD, desirable 
25 sites for mutation can be identified. 

the structure coordinates of a PPARa LBD provided in accordance 
with the present invention also facilitate the identification of related proteins or 
enzymes analogous to PPARa in function, structure or both, (for example, a 
PPARy) which can lead to novel therapeutic modes for treating or preventing 
30 a range of disease states. 
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IX.A. Sterically Similar Compounds 
; C A iurther aspect of, the present invention is that sterically similar 
compounds can be formulated to mimic the key portions of a PPAR LBD 
: structure. Such compounds are functional equivalents. The generation of a 
structural functional equivalent can be . achieved by the techniques of 
modeling and chemical design known to those of skill in the art and described 
herein. Modeling and chemical design of PPAR and PPAR LBD structural 
equivalents can be based on the structure coordinates of a crystalline PPARa 
LBD polypeptide of the present invention. It will be understood that all such 
sterically similar constructs fall.within the scope of the present invention. 

IX.B. PPARa Polypeptides 
:c The generation of chimeric PPAR polypeptides is also an aspect of the 
. present invention. : Such a chimeric polypeptide can comprise a PPAR LBD 
polypeptide or a portion of a PPAR LBD, (e.g. a PPARa LBD) that is fused to 
a candidate polypeptide or a suitable region' of the candidate polypeptide, for 
example PPARy. .Throughout the present disclosure it is f intended that the 
term "mutant" encompass not only mutants of a PPAR LBD polypeptide but 
chimeric proteins generated using a PPAR LBD as well. It is thus intended 
that the following discussion of mutant PPAR LBDs apply mutatis mutandis to 
chimeric PPAR and PPAR LBD polypeptides and to structural equivalents 
thereof. . 

In accordance with the present invention, a mutation can be directed to 
a particular site or combination of sites of a wild-type PPAR LBD. For 
example, an accessory binding site or the binding pocket can be chosen for 
mutagenesis. Similarly, a residue having a location on, at or near the surface 
of the polypeptide can be replaced, resulting in an altered surface charge of 
one or more charge units, as compared to the wild-type PPAR and PPAR 
LBD. Alternatively, an amino acid residue in a PPAR or a PPAR LBD can be 
chosen for replacement based on its hydrophilic or hydrophobic 
characteristics. 
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Such mutants can be characterized by any one of several 4 different 
properties as compared 1 with the wild-type PPAr LBD. For example, such 
mutants! can have "an altered 1 surface charge of one or more charge units, or 
:can have an increase in Overall stabi[ityr r Other mutahts can have altered 
5 substrate^ specificity in corhparisoh with/ or a higher specific activity than, a 
wild-type PPAR or PPAR LBD:- " ' " c,ov 

-i PPAR arid PPAR LBD- mutants^ of this -present ! ^inveM6n "can be 
generated in a number of ways; ; For example/ the wild-tyiie sequence of a 
PPAR -or a' PPAR LBD can be" mutated at those sit^s identified'us^ 
1 0 invention -as -desirable for mutation; by ^ineans 'of 'bligbnti^ 
^) hhutagehesiSHor bther conventional' metfiods/such as deletion! ,? 'Alternatively, 

mutants^ 6f a PPAR- or a PPAR : tBD cart generated by ^ 
replacement of ^a pafti^ 'amino 
acid:-^ln^addito^ mutant <^n r ^ 

15 replabemeht of an amino ; ^cid residuertei e cysteine or 

methionine 'residue, with : selenbcysteine'p> selehomethibnink : This can "be 
achieved by growing a host "organism capable of expressing either the wild- 
type or miitarit polypeptide bh 1 a - growth medium depleted of either natural 
cysteine or methionine (or 1 both) but enriched in selenocy^teihe or 
20 selenomethionine (or both). ^ ^ 5 r 

Mutations can be introduced into a DNA sequence coding for a PPAR 
or a PPAR LBD using synthetic oligonucleotides. These oligonucleotides 
J) contain nucleotide sequences flanking the desired imitation sites/ Mutations 

can be generated in the full-length DNA sequence of a PPAR or a PPAR LBD 
25 or in any sequence coding for polypeptide fragments of a PPAR or a PPAR 
LBD. 

According to the present invention, a mutated PPAR or PPAR LBD 
DNA sequence produced by the methods described above, or any alternative 
methods known in the art, can be expressed using an expression vector. An 
30 -expression vector, as is well known to those of skill in the art, typically 
includes elements that permit autonomous replication in a host cell 
independent of the host genome, and one or more phenotypic markers for 
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selection purposes. Either prior to or after insertion of the DNA sequences 
surrounding the desired PPAR or PPAR LBD mutant coding sequence, an 
expression vector also will include control sequences encoding a promoter, 
f operator, r ribosome binding site, translation initiation. signal, and, optionally, a 
5 repressor gene or various activator genes and a signal for termination, in 
some embodiments, where secretion of the produced mutant is desired, 
nucleotides encoding a "signal sequence" can be inserted prior to a PPAR or 
a PPAR LBD mutant coding sequence. For expression under the direction of 
the control sequences, a desired DNA sequence must be operatively linked to 

10 the control, sequences; that is, r the sequence must have an appropriate start 
signal in front of the L DNA. .sequence encoding the PPAR,or :i PPAR LBD 
mutant, ^and the correct reading frame to permit expression of that sequence 
under the t a>ntrol of the control sequences and production of the desired 
r pjoduct encoded by that PPAR or PPAR LBD sequence must be maintained. 

15 Any of a wide -variety of well-known available expression vectors can . 

be useful to express a mutated PPAR or PPAR LBD coding sequences of this 
invention. These include for example, vectors, consisting of segments of 
chromosomal, non-chromosomal and synthetic DNA sequences* such as 
various known derivatives of ,SV40, known bacterial plasmids, e.g., plasmids 

20 from E. coti including col E1, pCR1, pBR322, pMB9 and their derivatives, 
wider host range plasmids, e.g., RP4, phage DNAs, e.g., the numerous 
derivatives of phage X s e.g., NM 989, and other DNA phages, e.g., M13 and 
filamentous single stranded DNA phages, yeast plasmids and vectors derived J) 
from combinations of plasmids and phage DNAs, such as plasmids which 

25 have been modified to employ phage DNA or other expression control 
sequences. In the preferred embodiments of this invention, vectors amenable 
to expression in a pRSETA-based expression system are employed. The 
pRSETA expression system is available from Invitrogen, Inc., Carlsbad, 
California. 

30 In addition, any of a wide variety of expression control sequences — 

sequences that control the expression of a DNA sequence when operatively 
linked to it — can be used in these vectors to express the mutated DNA 
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sequences according to this invention. Such useful expression control 
sequences, include, for example, the early and late promoters of SV40 for 
animal cells;* the lac system; tS the trp system -the -TAG or TRC system, the 
major operator and promoter regions of phage X, the control regions of fd' coat 
5 protein; all for E. coli, the promoter for 3-phosphoglycerate kinase or other 
glycolytic ; enzymes; the ; promoters of acid phdyphatase, e.g M Pho5, the 
promoters of the yeast a-mating ^factors fo^r yeast, and other sequences 
known to 'control the expression of genes of prokairyotic or eukaryotic cells or 
their viruses, and various combinations thereof. ' - 
10 : , A wide variety of hosts ; are also "useful for prbciucing mutated 

PPARa and PPARa LBD polypeptide^' according to this invention J ^ TTiese 
hosts include, for example, bacteria, such as E. coli, Bacillus arid 
Str^iofn^ces 9 fungi, such Ss yeasts, arid i animal ceils, such as CHO and 
cbs-t'ceiis, plant cells, irisect dells, sklch as Sf9 cells, arid transgenic host 
15 -cells.'' - ■ - ••' ; •"• 

It should be understood that riot all expression vectors and expression 
systems function in the same way to express rnuiated DNA sequences of this 
invention, arid i fo- produce "modified PPAR and P PAR LBD polypeptides or 
PPAR 6r PPAR LBD niutarits. Neither do air hosts furidipri equally well wrt 
20 the isame expression systeml One of skill in trie art can, however, make a 
selection among these vectors, expression control sequences and hosts 
without undue experimentation and without departing from the scope of this 
J invention. For example, an important consideration in selecting a vector will 

be the ability of the vector to replicate in a given host. The copy number of 
25 the vector, the ability to control that copy nuriiber, arid the expression of any 
other proteins encoded by the vector, such as antibiotic markers, should also 
be considered. 

In selecting ari expression control sequence, a variety of factors should 
also be considered! These include, for example, the relative strength of the 
30 system, its controllability and its compatibility with the DNA sequence 
encoding a rinodified PPAR or PPAR LBD polypeptide of this invention, with 
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particular regard to the formation of potential secondary and tertiary 
structures. 

Hosts should be selected by consideration of their compatibility with the 
chosen vector, the toxicity of a modified PPAR or PPAR LBD to them, their 
ability to express mature products, their ability to fold proteins correctly, their 
fermentation requirements, the ease of purification of a modified PPAR or 
PPAR LBD and safety. Within these parameters, one of skill in the art can 
select various vector/expression control system/host combinations that will 
produce useful amounts of a mutant PPAR or PPAR LBD, A mutant PPAR or 
PPAR LBD produced in these systems can : be purified by a variety of 
conventional steps and strategies, including those used to purify the wild-type 
PPAR or PPAR LBD. 

Once a PPAR LBD mMtation(s) has been generated in the . desired 
location, such as an active site or dimerization site, the mutants can be tested 
for any one of several properties of interest. For example, mutants can be 
screened for an altered charge at physiological pH. . This is determined by 
measuring the mutant PPAR or PPAR LBD isoelectric .point (pi) and 
comparing the observed value with that of the wild-type parent. Isoelectric 
point can be measured by gel-electrophoresis according to the method of 
Wellner (Wellner , (1971) Anal. Chem. 43: 597). A mutant PPAR or PPAR 
LBD polypeptide containing a replacement amino acid located at the surface 
of the enzyme, as provided by the structural information of this invention, can 
lead to an altered surface charge and an altered pi. 

IX.C. Generation of an Engineered PPARa LBD or PPARa LBD 
Mutant 

In another aspect of the present invention, a unique PPAR or PPAR 
LBD polypeptide can be generated. Such a mutant can facilitate purification 
. and the study of the ligand-binding abilities of a PPAR polypeptide. 

As used in the following discussion, the terms "engineered PPAR", 
"engineered PPAR LDB", "PPAR mutant", and "PPAR LBD mutant" refers to 
polypeptides having amino acid sequences which contain at least one 
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mutation in the wild-type sequence. The terms also refer to PPAR and PPAR 
LBD polypeptides which are capableof exerting a biological effect in that they 
comprise all or a part of the amino acid sequence of an engineered PPAR or 
-PPARLBD jmutant polypeptide of the present invention, or- cross-react with 
5 antibodies raised against an engineered PPAR or PPAR LBD - mutant 
polypeptide, or retain all or some or/an enhanced degree of the biological 
activity of the engineered PPAR or PPAR LBD mutant amino acid sequence 
or protein. Such biological activity can include the binding of small molecules 
in general,- and the binding of Compound ^ in particular. ti:- 

1 0 The .terms "engineered PPAR LBD" and "PPAR LBD mutant" ( also 

) includes analogs of an engineered PPAR LBD or PPAR J -LBD - mi!ftant 

polypeptide. By "analog" is intended that a DNA or polypeptide sequence can 
contain altenatipnsTelative tor the sequences disclosed herein, yet retain all or 
sqme^^ of those sequences. 

15 Analogs can be derived from genomic nucleptideifsequences or from other 
organisms, ; on can be created synthetically. ; Those of skill in the art will 
appreciate that other analogs^ as yet undisclosed or undiscovered; can be 
used to design and/or construct PPAR LBD- or PPAR LBD mutant analogs. 
There is nor need for an engineered PPAR LBD or PPAR LBD mutant 

20 polypeptide to comprise all or substantially all of the amino acid sequence of 
SEQ ID NOs:2 or 4. Shorter or longer sequences are anticipated to be of use 
in the invention; shorter sequences are herein referred to as "segments". 
) Thus, the terms "engineered PPAR LBD" and "PPAR LBD mutant" also 

includes fusion, chimeric or recombinant engineered PPAR LBD or PPAR 

25 LBD mutant polypeptides and proteins comprising sequences of the present 
invention. Methods of preparing such proteins are disclosed herein above 
and are known in the art. 

IX.D: Sequence Similarity and Identity 
30 As used herein, the term "substantially similar" means that a particular 

sequence varies from nucleic acid sequence of SEQ ID NOs:1 or 3, or the 
amino acid sequence of SEQ ID Nds:2 or 4 by one or more deletions, 
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substitutions, or additions, the net effect of which is to retain at least some of 
biological activity of. the naturaLgene, gene product, or sequence. Such 
sequences include "mutant" or "polymorphic" sequences, or sequences in 
which the biological activity and/or the physical properties are altered to some 
5 degree but retains at least some or /an enhanced degree of the original 
^biological activity and/or physical properties. In determining nucleic acid 
sequences, all subject nucleic acid sequences capable of encoding 
substantially similar amino acid sequences are considered to be substantially 
similar to a reference nucleic acid sequence, regardless of differences in 
10 codon sequences or substitution of equivalent amino acids ^ to create 
biologically functional equivalents. : — • : . 

h , ^ IX.D.1. Sequences -That ^are -Substantially -Identical to an 
r % : > ■ Engineered PPAR or PPAR LBD Mutant Sequence of the 

15 - r Present Invention : , . -i - * •-• < 

he Islucleic acids that are substantially identical to a nucleic acid sequence 
of an engineered PPAR or PPAR LBD mutant of the present invention, e.g. 
allelic variants, genetically altered versions d the gene, etc., bind to an 
engineered PPAR or PPAR LBD mutant sequence : under . stringent 

20 hybridization conditions. By using probes, particularly labeled probes of DNA 
sequences, one can isolate homologous or related genes. The source of 
homologous genes can be any species, e.g. primate species; rodents, such 
as rats and mice, canines, felines, bovines, equines, yeast, nematodes, etc. 
Between mammalian species, e.g. human and mouse, homologs have 

25 substantial sequence similarity, i.e. at least 75% sequence identity between 
nucleotide sequences. Sequence similarity is calculated based on a 
reference sequence, which can be a subset of a larger sequence, such as a 
conserved motif, coding region, flanking region, etc. A reference sequence 
will usually be at least about 18 nt long, more usually at least about 30 nt long, 

30 and can extend to the complete sequence that is being compared. Algorithms 
for sequence analysis are known in the art, such as BLAST, described in 
Altschul et al„ (1 990) J. Mol. Biol. 21 5: 403-1 0. 
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Percent identity or percent similarity of a DNA of peptide sequence.can 
be. determined, 5 for example, by comparing sequence information using ihe 
'GAP computer program, 1 available from the Univereity of Wisconsin Geneticist 
cComputer^Group.-n-'Th^ GAP program utilizes %e alignment' method of 
5 v Needlemarvet al: , (1970). J. Mob Biol. 48: 443, as 'revised by' iSmith e et al. , 
(1981) Adv: Apply Math: 2:482: Briefly, the GiAP program defirte^ similarity as 
the number of aligned symbols (i.e.y nucleotides or amino acids) which are 
similar,, divided by the total number of symbois in the shorter of the two 
sequences:^ The preferred parameters for the- GAP program are the^default 
1 0 . parameters^ which do not impose a penalty for^end gaps; - See, e.g. , Schwartz 
} ' et ' a| >r ; eds- 9 o(1 979), - Atlas of Protein : SequericeQarid Stnjrture t nr Natidna 

Biomedical Research Foundation , pp. 357-358, and Gribskov et ah , (1 986) 
Nucl; Acids: Res; 14: 6745; ^ ,03*X* ^'nc^-'n^vn d *rroiro*--3i :^;;rr- 

vi^i ^••x;^The r term ^slmila'ri^ls contrasted with the term-Identity". Similarity is 

15 defined as above; "identity", however, means a nucleic acid or amiriaacid 
sequence having the same amino acid at the sarhe reliative position iri a given 
family^membep of a gene family. Homology arid similarity are generally 
viewed as broader terms than the term identity. Biochemically similar amino 
acids,' for example leucine/Isoleucine or glutamate/aspartate, can be present 

20 at the same position— -these are not identical per se, but are biocherhically 
"similar." As disclosed herein, these are referred to as conservative 
differences or conservative substitutions. This differs from a conservative 
) mutation at the DNA: level, which changes the nucleotide sequence without 

making a change in the encoded amino acid, e.g. TCC to TCA, both of which 

25 encode serine. > : . 

As used herein, DNA analog sequences are "substantially identical" to 
specific DNA sequences disclosed herein if: (a) the DNA analog sequence is 
derived from coding regions of the nucleic acid sequence shown in SEQ ID 
NOs:1 or 3; or (b) the DNA analog sequence is capable of hybridization with 

30 DNA sequences of (a) under stringent conditions and which encode a 
biologically active PPARa or PPARa LBD gene product; or (c) the DNA 
sequences are degenerate as a result of alternative genetic code to the DNA 
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analog sequences defined in (a) and/or (b). Substantially identical analog 
proteins and nucleic acids will have between about, 70% and 80%, preferably 
between about 81% to about 90% or even more, preferably between about 
91% and 99% sequence identity with the corresponding sequence of the 
native protein or nucleic acid. Sequences having lesser degrees of identity 
but comparable biological activity are considered to be equivalents. 

As used herein, "stringent conditions" means conditions of high 
stringency, for example 6XSSC, 0.2% polyvinylpyrrolidone, 0.2% Ficoll, 0.2% 
bovine serum albumin, 0.1% sodium dodecyl sulfate, 100 |ug/ml salmon sperm 

. DNA and 15% formamide at 68°C. For the purposes of specifying additional 
conditions of high stringency, preferred conditions are salt concentration of 
about 200 mM and temperature of about 45°C, One example of such 
stringent conditions is hybridization at 4XSSC, at 65°C, followed by a washing 
in 0.1XSSC at 65°C for one hour. Another exemplary stringent hybridization 

^scheme uses 50% formamide, 4XSSC at 42°C. ^ ; - ; . 

In contrast, nucleic acids having sequence similarity are detected by 
hybridization under lower stringency, conditions. Thus, sequence identity can 
be :1 determined by hybridization under lower stringency- conditions, for 
example, at 50° C or higher and 0.1XSSC (9 mM NaCI/0.9 mM sodium citrate) 
and. the sequences will remain bound when subjected to washing at 55°C in 
1XSSC. 

IX.P.2. Complementarity and Hybridization to an Engineered 
PPARa or PPARa LBD Mutant Sequence 

As used herein, the term "complementary sequences" means nucleic 
acid sequences which are base-paired according to the standard Watson- 
Crick complementarity rules. The present invention also encompasses the 
use of nucleotide segments that are complementary to the sequences of the 
present invention. 

Hybridization can also be used for assessing complementary 
sequences and/or isolating complementary nucleotide sequences. . As 
discussed above, nucleic acid hybridization will be affected by such conditions 
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as salt concentration, temperature, or organic solvents, in addition to the base 
.composition,, length of the complementary istrands, and the number of 
nucleotide base mismatches between the hybridizing nucleic acids; as will be 
: readily appreciated by those , skilled in* thenar! Stringent temperature 
5 conditions will generally include temperatures in excess of about 30°C, 
typically in excess of about 37°C, and preferably in excess of about 45°C. 
Stringent salt conditions will ordinarily be less than about 1 ,000 mM, typically 
• e ?.s than about 500 mM, and preferably less than about 200 mMv However, 
the combination. of parameters is much more important than the measure of 
10 any single parameter. See, e.g. , Wetmur & Davidson , (1968) J. MoL Biol. 31: 
) 349-70. ' Determining appropriate hybridization conditions to identify and/or 

isolate sequences containing high levels of homology is well known in the art. 
See, e.g. , Sambrook et al , ;(1 989) Molecular Cloning: A • Laboratory -Manual , 
Cold Spring- Harbor,, New* York.. / . : • -n- ;,>K\ ^-i-o^s; on & ^*\Jveio 
15 :. IX;D.3. : Functional > Equivalents of an ' [Engineeredj PPARa or 

PPARa LBD Mutant Nucleic Acid Sequence - of the 
Present Invention - ^ ^ ^ - . .:„..,..<.,- 
As used herein, the term "functionally equivalent cbddn" is used to refer 
to codoris that encode the same amino acid, such as the ACG and AGU 
20 codons for serine. PPARa or PPARa LBD^encoding nucleic acid sequences 
comprising SEQ ID NOs:1 and 3 which have functionally' equivalent colons 
are covered by the present Invention. Thus, when referring to the sequence 
example presented in SEQ ID Nds:1' and 3, applicants contemplate 
substitution of functionally equivalent codons into the sequence example of 
25 SEQ ID NOs:1 and 3. Thus, applicants are in possession of amino acid and 
nucleic acids sequences which include such substitutions but which are not 
set forth herein in their entirety for convenience. 

It will also be understood by those of skill In the art that amino acid and 
nucleic acid sequences can include additional residues, such as additional N- 
30 or C-terminal amino acids or 5* or 3' nucleic acid sequences, and yet still be 
essentially as set forth in one of the sequences disclosed herein, so long as 
the sequence retains biological protein activity where polypeptide expression 
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is concerned. The addition of terminal sequences particularly applies to 
nucleic acid sequences which can, for example, include various non-coding 
sequences flanking either of the ff or 3* portions of the coding region or can 
include various internal sequences, i.e., introns, which are known to occur 
within genes. 

1X.D.4. Biological Equivalents 

The present invention envisions and includes biological equivalents of 
a engineered PPAR or PPAR LBD mutant polypeptide of the present 
invention. The term "biological equivalent" refers to proteins having amino 
acid sequences which are substantially identical to the amino acid sequence 
of an engineered PPAR LBD mutant of the present invention and which are 
capable of exerting a biological effect in that they are capable of binding small 
molecules or cross-reacting with anti- PPAR or PPAR LBD mutant antibodies 
raised against an engineered mutant PPAR or PPAR LBD polypeptide of the 
present invention. ; 

For example, certain amino acids can be substituted for other amino 
acids in a protein structure without appreciable loss of interactive capacity 
with, for example, structures in the nucleus of a cell. Since it is the interactive 
capacity and nature of a protein that defines that protein's biological functional 
activity, certain amino acid sequence substitutions can be made in a protein 
sequence (or the nucleic acid sequence encoding it) to obtain a protein with 
the same, enhanced, or antagonistic properties. Such properties can be 
achieved by interaction with the normal targets of the protein, but this need 
not be the case, and the biological activity of the invention is not limited to a 
particular mechanism of action. It is thus in accordance with the present 
invention that various changes can be made in the amino acid sequence of an 
engineered PPAR or PPAR LBD mutant polypeptide of the present invention 
or its underlying nucleic acid sequence without appreciable loss of biological 
utility or activity. 

Biologically equivalent polypeptides, as used herein, are polypeptides 
in which certain, but not most or all, of the amino acids can be substituted. 
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Thus,when referring to the sequence examples presented in SEQ ID NOs:1 
and - 3, applicants - envision substitution of codons tKaf ; enrocle biologically 
equivalent amino acids, as described herein, into the £ sequence example of 
SEQ ID NOs:2 and 4, respectively; 3 ThusT applica^ possession of 

5 amino acid and nucleic acids sequences which include such substitutions but 
which are not set forth herein in their entirety for convenience. 0 * 

■'•!■ ^Alternatively^ functionally equivalent" proteins 1 or^peptides" can be 
created via the application of recombinant DNA technology; in which changes 
in the protein structure can be engineered, based on f <X)hsiderations of the 

1 0 : properties of the'amirio acidsi being exchanged, e.g: f substitution of lie for Leu. 
} Changes designed by man can 6e- introduced through the appiicatioh of site- 

directed ^mutagenesis techniques,- e.g., : v to introduce" iifiiprovements ^to ' "the 
antigenicity bf - the protein- o'r' to ; fest an fehgirifeered' PPAR 1 dr ^PAR LBD 
; mutant polypeptide of the^preserif invention in 'order td'modulate lipid^bihding 

15 .'brother acthntyV&t 'the'rtiolecula^level.' J ; ' ^ ^: --- : -* . ^ — 
;.r>: t; Amino acid substitutions? such -as those which might bd 'ferhployed in 
modifying an engineered PPAR or PPAR LBD ' mutant "polypeptide of the 
" present invention are generally, but not necessarily, based on the relative 
^similarity of the amino acid side-chain^ f substituerits,' for example, their 

20 hydrophobicity, hydrophilfcity, charge, size; and the like. 4 " ,J An ^analysis of the . 
size/ shape and type of the amino acid side^hain' substituents reveals that 
arginine, lysine and histidine are all positively charged residues; that alanine, 
) glycine and serine are all of similar size; arid that phenylalanine, tryptophan 

and tyrosine all have a gerierally similar shape. Therefore, based upon these 

25 considerations, arginine, lysine and histidine; alanine, glycine and serine; and 
phenylalanine, tryptophan and tyrosine; are defined herein as biologically 
functional equivalents. Other biologically functionally equivalent changes will 
be appreciated by those of skill in the art. it is implicit in the above discussion, 
however, that one of skill in the art can appreciate that a radical, rather than a 

30 conservative substitution is warranted in a given situation. Non-cbhservative 
substitutions in engineered mutant PPAR or PPAR LBD polypeptides of the 
present invention are also ah aspect of the present invention. 
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In making biologically functional equivalent amino acid substitutions, 
the hydropathic index of amino acids can be considered. Each amino acid 
has been assigned a hydropathic index on the basis of their hydrophobicity 
and charge characteristics, these are: isoleucine (+ 4.5);, valine (+ 4.2); 
5 leucine (+ 3.8); phenylalanine (+. 2.8); cysteine (+ 2.5); methionine (+ 1.9); 
alanine (+1.8); glycine (.0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); 
tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (- 
3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

The importance of the hydropathic amino acid, index in conferring 

10 interactive biological function on a protein is generally understood in the art 

(Kyte & Doolittte . (1982),. J. Mol. Biol. 157: ,105-132, incorporated herein: by ) 
reference). It is known that certain amino acids can be substituted for other 
.amino acids having, a similar hydropathic index or score and still retain a 
.similar^ making changes based upon . the hydropathic 

15 index, the substitution of amino acids whose. hydropathic indices are within ±2 
of the original value is preferred, those which are within ±1 of the original 
value are particularly preferred, and those within ±0.5 of the original value are 
even more particularly preferred. , v . 

It is also understood jn the art that the substitution of like amino acids 

20 can be made effectively on the basis > of hydrophilicity. U.S. Patent No. 
4,554,101, incorporated herein by reference, states that the greatest local 
average hydrophilicity of a protein, as governed by the hydrophilicity of its 
adjacent amino apids, correlates with its immunogenicity and antigenicity, i.e. J) 
with a biological property of the protein., It is understood that an amino acid 

25 can be substituted for another having a similar hydrophilicity value and still 
obtain a biologically equivalent protein. 

As detailed in U.S. Patent No. 4,554,101, the following hydrophilicity 
values have been assigned to amino acid residues: arginine (+ 3.0); lysine (+ 
3.0); aspartate (+ 3.0±1); glutamate (+ 3.0±1); serine (+ 0.3); asparagine (+ 

30 0.2); glutamine (+ 0.2); glycine (0); threonine (-0.4); proline (-0.5+1); alanine (- 
0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); leucine (- 
1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). 
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c .Jn making changes based upon similar hydrophilicity vak/es, the 
substitution of amino, acids whose- hydrophilicity values- are" within ±2 of the 
original value is preferred, those which are within ±1 of the original value are 
particularly preferred, and those within ±0.5 of the original value are even 
5 more particularly preferred. " r : 

. While discussion has focused on functionally equivalent polypeptides 
arising from, amino acid changes, it will be appreciated that these changes 
can be effected by alteration of the encoding DNA, taking into consideration 
also that the genetic code is degenerate and that two or more codohs can 
10 code for the same amino acid. : ? ^ 
^) ; -.. ; Thus, lt : will also be understood that this invention is not limited to the 

: particular amino acid and nucleic • acid sequences of SEQ ID NOs:l-4. 
Recombinant vectors - and isolated - DNA segments' can therefore* variously 
include an engineered PPARa or PPARa LBD mutant "polypeptide^ncbding 
15 region itself, include' coding regions bearing selectee! 5 alterations^ or 
modifications in the basic coding 'region/or include larger ^ polypeptides which 
nevertheless comprise arv PPARd or 'PPARa LBb mutant polypepltide- 
encoding regions or can encode biologically functional equivalent proteins or 
polypeptides which have variant amino acid sequences. Biological activity of 
20 an engineered ! :f PPARa or PPARa ''lBD'' ,a 'miifant vV ^lypepOde can be 
determined, for example, by iipid-binding assayd known to those of skill in the 
art. "" r 

) The nucleic acid segments of the present invention, regardless of the 

length bf the coding sequence itself, can be combined with other DNA 

25 sequences, such as promoters;, enhancers, poiyadenylation signals, additional 
restriction enzyme sites, multiple cloning sites, other coding segments, and 
the like, such that their overall length can vary considerably. It is therefore 
contemplated that a nucleic acid fragment of almost any length can be 
employed, with the total length preferably being limited by the ease of 

30 preparation and use in the intended recombinant DNA protocol. For example, 
nucleic acid fragments can be prepared which include a short stretch 
complementary to a nucleic acid sequence set forth in SEQ ID NOs:1 and 3, 
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such as about 10 nucleotides, and which are up to 10,000 or 5,000 base pairs 
in length. DNA segments with total lengths of about 4,000,. 3,000, 2,000, 
1 ,000, 500, 200, 100, and about 50 base pairs in length are also useful. . 

The DNA segments of the present invention encompass biologically 
5 functional equivalents of engineered PPAR or PPAR LBD . mutant 
polypeptides. Such sequences can rise as a consequence of codon 
redundancy and functional equivalency that are known to occur naturally 
within nucleic acid sequences and the proteins thus encoded. . Alternatively, 
functionally equivalent proteins or polypeptides can be created via the 

10 application of recombinant DNA technology, in which changes in the protein 

structure can be engineered, based on considerations of the properties of the 1 ^) 

amino acids being exchanged. Changes can be introduced through the 
application of site-directed mutagenesis techniques, e.g., to introduce 
improvements to the , antigenicity of the protein ~ or. to test variants of an 

15 engineered PPAR or PPAR LBD mutsmt of the present invention in order to 
examine the degree of lipid-binding activity, or other activity at the. molecular 
level. Various site-directed mutagenesis techniques are known to those of 
skill in the art and can be employed in the present invention. 

The Invention further encompasses fusion proteins and peptides 

20 wherein an engineered PPAR or PPAR LBD mutant coding . region of the 
present invention is aligned within the same expression unit with other 
proteins or peptides having desired functions, such as for purification or 
immunodetection purposes. ^ 
Recombinant vectors form important further aspects of the present 

25 invention. Particularly useful vectors are those in which the coding portion of 
the DNA segment is positioned under the control of a promoter. The promoter 
can be that naturally associated with a PPAR gene, as can be obtained by 
isolating the 5' non-coding sequences located upstream of the coding 
segment or exon, for example, using recombinant cloning and/or PCR 

30 technology and/or other methods known in the art, in conjunction with the 
compositions disclosed herein. 
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jn other embodiments, certain advantages will be gained by positioning 
the coding DNA segment under the control of a recbmbinanVor heterologous, 
promoter. As used herein, a recombinant or heterologous promoter is a 
promoter that is not normally associated with a PPAR gene in its natural 
5 environment. Such promoters can include promoters isolated from bacterial, 
viral, eukaryotic, or mammalian cells. Naturally, it will be important to employ 
a promoter that effectively directs the expression of the DNA segment in the 
cell type chosen for expression. The use of promoter and cell type 
combinations for protein expression is generally known to those of skill in the 

1 0 art of mplecular ; biology (See, e.g. , Sambrooket al. , (1989) Molecular Cloning: 
A Laboratory Manual , Cold Spring Harbor, Laboratory, New York, specifically 
incorporated herein^j by .reference). r The : promoters employed can be 
. constitutive or inducible and.can be used under the appropriate conditions to 
direct - high level .expression of -the introduced DNA segment, such as is 

15 advantageous in the large-scale production of recombinant proteins or 
peptides. One preferred prompter system contemplated for use in high-level 
expression is a T7 promoter-based system. 



20 . . j , : : v 

X The Role of the Three-Dimensional Structure of the PPARa LDB in 
Solving Additional PPAR Crystals 
) Because polypeptides can, crystallize in more than one crystal form, the 

structural coordinates of a PPARa LBD, or portions thereof, as provided by 

25 the present invention, are particularly useful in solving the structure of other 
crystal forms of PPARa and the crystalline forms of other PPARs. The 
coordinates provided in the present invention can also be used to solve the 
structure of PPAR or PPAR LBD mutants (such as those described in Section 
IX above), PPAR LDB co-complexes, or of the crystalline form of any other 

30 protein with significant amino acid sequence homology to any functional 
domain of PPAR. 
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X.A. Determining the Three-Dimensional Structure of a Polypeptide 
Using the Three-Dimensional Structure of the PPARa LBD as a 
Template in Molecular Replacement 
, One method that can be employed for the purpose of solving additional 
5 PPAR crystal structures is molecular replacement See generally , Rossmann , 
ed, (1972) The Molecular Replacement Method , Gordon & Breach, New York. 
In the molecular replacement method, the unknown crystal structure, whether 
it is another crystal form of a PPARa or a PPARa LBD, (i.e. a PPARa or a 
PPARa LBD mutant), or a PPARa or a PPARa LBD polypeptide complexed 
10 with another compound (a "co-complex"), or the crystal of some other protein 
with significant amino acid sequence homology to any functional region of the 
PPARa LBD, can be determined using the PPARa LBD structure coordinates 
provided in Table 2. This method provides aft accurate ^structural fonfnforthe 
unknown crystal more quickly -and efficiently than attempting to determine 
15 such information ab initio. - > ^ . r ^ 

In addition/ in accordance with this invention, PPARa or PPARa LBD 
mutants can be crystallized in complex with known modulators. TKe crystal 
structures of a series of such complexes can then be solved by molecular 
replacement and compared with that of wild-type PPARa or the wild-type 
20 PPARa LBD. Potential sites for modification within the various binding sites 
of the enzyme can thus be identified. This information provides an additional 
tool for determining the most efficient binding interactions, for example, 
increased hydrophobic interactions, between the PPARa LBD and a chemical 
entity or compound. 

25 AH of the complexes referred to in the present disclosure can be 

studied using X-ray diffraction techniques (See, e.g. , Blundell & Johnson 
(1985) Method.EnzymoL, 114A & 115B, (Wyckoff et al. , eds.), Academic 
Press) and can be refined using computer software, such as the X-PLOR™ 
program (BrQnger , (1992) X-PLOR, Version 3.1. A System for X-ray 

30 Crystallography and NMR, Yale University Press, New Haven, Connecticut; 
X-PLOR is available from Molecular Simulations, Inc., San Diego, California). 
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Thisiinformatioh can thus ber used to optimize known classes *of PPAR and 
; PPAR.LBD modulatorsy:and:more importantly, tb^desigh arid synthesize hovel 
classes of PPAR and PPAR LBD modulatory. -..«;•--. ^ '- ' ^ 1 

,^--:C; *• Y •.vO^.:-,- - n:*^; . ^r-> - : - .:■ * 

5 « . , : h — "Laboratory Examples 

1 the follbwirfg included to illustrate 

preferred mbdes^of the 7 inVerrtibrr: : Certain aspects of th^ fbllbwihg Laboratory 
Examples are deiscribed - in terms of techniques iand procedures found or 
contemplated by the present inventors to ' work v ^ 
10 'invention:' ^ through ^the use of 

) standard- labbraioiy ;i ^ r inventors^ in 'light ^ oK r the ' present 

disclbsui^ahd ; thef general fevel of skill in the akl^hbs^of skill will appreciate 
that the following laboratory ^ ■Example's are intehdecl to be exemplary oniyand 
that - numerous- cha^^ ' arid c ; aiterWtibhs can be J ^empibyed 

15 withoutldepartng'ffeWthe 'spirit arid scope of the invention. c ~ ' - y 

, : h r rv;rn ^- f ». ^Laboratory Example 1 " u ? '•■'^ 'V' ;: 

i.r; .;'^-v:) ; rc«V-\ ; Protein Preparation r ^ ; \om-vo\ 

: - -The nucleic acid sequence encoding thb PPARa ligarid binding domain 
20 (amino acids 192-468) tagged with MKKGHHHHHHG (SEQ ID NO: 9) T was 
operatively linked to 7 and expressed using theT7 promoter of plasmid vector 
pRSETA. BL21(DE3) E. coli cells transformed with this expression vector 
) were grown at 24°C in shaker flasks for 66 hours on 2xYT medium (16 g/L 

Bacto-Tryptone, 10 g/L yeast extract, 5 g/L NaCI, QC with distilled water) with 
25 50 mg/L carbenicillin to an ODeoo of approximately 9.0. The cells were 
harvested, resuspended with 20 ml extract Buffer (20 mM HEPES, pH 7.5, 50 
mM imidozle, 250tmM NaCI and a pinch of lysozyme) per liter of cells arid 
were Jysed: by sonication for 20 minutes on iceV The lysed < cells were 
centrifuged at 40,000g for 4Q minutes, and the supernatant was loaded on a 
30 100 ml Ni-agarose column. r - \ -i? 

The column was washed with 150 ml Buffer A (10% glycerol, 20 mM 
HEPES pH 7.5, 25 mM imidizole) and the protein was eluted with a 450 ml 
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gradient of Buffer B (10% glycerol, 20 mM HEPES pH 7.5, 500 mM imidozle). 
The protein, which eluted at 20% Buffer B, was diluted with one volume of 
Buffer C (20 mM HEPES, pH 7.5, 1 mM EDTA), and loaded on an 100 ml S- 
Sepharose™ (Pharmacia, Peapack, New Jersey) column. The column was 
5 washed with a 100 ml Buffer C and the PPARa LBD protein was eluted with a 
2 9° ml gradient, of Buffer D. (20 mM HEPES, pH. 7.5, 10 mM DTT, 1 M 
ammonium acetate). The PPAR alpha LBD eluted from the column at 43% 
Buffer D^JT^.P^eip y! 6 '^^ 3 ?,.? mgl L of cells grown. and was >95% .pure, 
as determined by SDS-PAGE analysis..., , . , 

10 The protein was then diluted to ,1„mg/ml with Buffer C such that the 

,f»naf .buffer composition was 220 mM ammonium acetate, 2Q mM HEPES pH ) 
.7.5, 1 mM EDTA and 1 mM DTT. > The diluted protein was aliquoted into 9 ml 
aliquote, flashy frozen with liqujd nitrogen , and stored .at^OfjC. To prepare 
\^™Plf*?$an (ndividuaLaliquot 

15 SRC1 (See, XuetaL (1999X Mol, Cell 3: 397-403) vyas added in a mol ratio of 
1.5 as a 2mg/ 100 DMSO stock. The peptide was then added in a mol ratio 
of 5:1 as a 2mg/100 jil DMSOstock and spun at 4K for 20 min to clarify the 
solution before concentrating jn . Centriprep™ 30 filtration units (Millipore, 
Bedford, :: MasrachuTCtt8)/\ .The solution -containing the PPARa LBD-SRC1 

20 complexes was concentrated to approximately 10 mg/ml with 80% yield. The ; 
complexes were then aliquoted in single use aliquots of 30 pi, flash frozen in 
liquid nitrogen and stored at -80°C. ' 

4 Laboratory Example 2 

25 Crystallization and Data Collection 

The crystals disclosed in this invention . were grown at room 
temperature using the hanging drop vapor diffusion method. The hanging 
drops comprised 1 jil of the above protein-Iigand solutions, and were mixed 
with an equal volume (1 jil) of well buffer comprising 7% PEG 3350, 200 mM 
30 NaF, and 12% 2,5 hexanediol. 
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Before data collection, crystals were transiently mixed with well buffer 
that contained an additional' 10% : hexanediol as a cryoprotectant, and then 
flash frozen in liquid nitrogen. ' > ' ; • 

The. PPARa crystals formed in this P2i2i2 space group, wrth a=61.3 A, 
5 b = 103.5A, c = 49.9A; Each asymmetry unit contained a single PPARa LBD 
with 45% solvent content. Crystals contained Compound 1, the PPARd LBD 
and SRC1 peptide, in a ; ratio of 1 :1 :1 . Data were collected With a Rlgaku R- 
Axis II (Rigaku, Tokyo, Japan) detector in house, or with a MAR CCD detector 
in the IMCA 17ID beam line at the Argonne National Laboratory (Argonne, 
10 Illinois), and the observed reflections were reduced, merged and scaled with 
DEfSfZO™ and SCALEPACK™ fn the' HKL2000 package I fbtwinowski , (1993) 
in Proceedings of the CCP4 Study Weekend: Date [Cblle^jon arid Processing . 
(Sawyer r et al/ , eds.), pp.56-6^rSERC Dardsbury 

15 Laboratory Exanfjjjle 3 

r Structure Determination arid Refinement 5 
The structure was determined by rholecular replacement methodiwith 
the CCP4 AmoRe program (Collaborative Computational Project Number 4, 
1994; Navaza, (1994) Acta. Crysi. A50: 157-163) using the structure 
20 coordinates for the PPAR8 LBD (Xu et al. , (1999) Mbl. Cell 3: 397-403), 
residues 167-441 , as the initial model (T able 3). The best fitting solution gave 
a correlation coefficiency of 70% and an R-factor of 33%. Model building was 
J performed with the software program QUANTA™, and structure refinement 

was achieved using the CNS software program (Brunger et al , (1998) Acta. 
25 Crystallogr. D54: 905-921). Structure refinement involved multiple cycles of 
manual rebuilding. The final structure (Table 2 and Figures 1, 2, 4 and 7) 
includes one PPARa LBD, one SRC1 peptide and Compound 1. The 
statistics of the structures are summarized in Table 1. 
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Laboratory Example 4 
w ... ; <. Computational Analysis * ;. 

Surface areas were calculated using both, the Connolly MS program 
(Connolly , (1983) Science 221: 709-713) and the MVP program (Lambert , 
(1997) in Practical Application of Computer-Aided Drug Design , (Charifson. 
ed.), pp. 243-303, Marcel-Dekker, New York). The C2-symmetry axis, 
sequence alignments and binding site accessible waters were calculated 
using the software program MVP. 
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TABLE 1 

PARAMETERS FOR THE CRYSTALLINE FORM OF THE LIGAND BINDING 
DOMAIN (RESIDUES 207-441 ) OF PPARoc IN COMPLEX WITH 

COMPOUND 1 



X-ray Source 


Glaxo Facility 


IMCA 


Space Group 




P2i2i2 


Resolution (A) 


20.0- 2.4 


20.0-1.8 


Unique Reflections (N) 


64,772 


30,147 


Completeness (%) 


95.2 


99.4 


l/g(last shell) 


25.4 (3.3) ~ 


39.5 (5.2) 


Rsym 3 (%) 


9.4 


4.5 


Refinement Statistics 






R factor* (%)(2a) 




22.9 


R free 0 (%)(2ct) 




26.9 


R.M.S.D. Bond 






Lengths (A) 




0.010 


R.M.S.D. Bond 






Angles (degrees) 




1.357 


Number of H 2 0 Molecules 




396 


Total Non-hydrogen Atoms 




2659 



r.m.s.d. is the root mean square deviation from Ideal geometry. 

a Rsym = Z I 'avg - I/ | / Z '/ 

b Rfactor = Z I Fp • Fpcaic I / Z f p « where Fp and F pca ic are observed and 
1 0 calculated structure factors, respectively 

C R free is calculated from a randomly chosen 10% of reflections that have 
never been used in refinement, and R factor is calculated for the remaining 90% 
of reflections. 
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TABLE 2 

. ATOMIC STRUCTURE COORDINATE DATA OBTAINED FROM X-RAY 
DIFRACTION FROM THE LIGAND BINDING DOMAIN OF PPARa in 

coMPijExwrracoupbuNb i > 



ATOM 


ATOM 
TYPE 


RESIDUE 

i 


**• 

PROTEINS 


- # 


X 


Y 


Z 


pee 


B 


1 


CB 


ASP 


A 


202 


14.533 


5^75 


19.857 


1.00 


78.46 


2 


CG : 


•ASP 


A 


202 


14.270 


4.099 


20.079 


1.00 


79.53 




ODL 


ASP 


-A ■ 


202 


' 13.370 


3.547 


19.410 


1.00 


80.23 


4 : . 


OD2 


ASP , 


i A o 


202 


14.963 


31492 


20.921 


1.00 


80.14 


• .-; 5 .,- 


C 


i ASP 


A 


202 


15.243 


7.375 


18.274 


1.00 


76.33 


6 


O 


'-ASP < 


"-A 


202 


14.377 


8.174 


17.918 


.1.00 


76.50 


7 


N 


- ASP ; 




202 


16.101 


5.066 


18.013 


1.00 


77.36 


• 8 


GA . s 


ASP : 


A •; 


202 


14.925 


5.890 


18.412 


1.00 


77.08 


9 


N. 


* LEU 


'A' - 


203 


16.488 


7.739 


18.565 


1.00 


74.80 


- 10 


CA 


' LEU 


A 


203 


16.927 


9:125 


18.460 


1.00 


72.96 


,11 


CB 


LEU 


A 


203 


18.150 


9378 


19.344 


1.00 


73.58 


12 : 


CG 


LEU 


A * 


203 


18.821 


10.740 


19.132 


1.00 


73.64 


.13 


CDl 


LEU 


A 


203 


18.061 


11.815 


19:895 


1.00 


73.44 


14 


CD2 


LEU . 


A 


203 


20.265 


10.680 


19.597 


1.00 


73.65 


15 


C 


v LEU 


A 


203 


17.304 


9.418 


17.019 


1.00 


71.55 


16 : 


O 


^LEU 


A 


203 


17.251 


10.561 


16.574 


1.00 


71.54 


17 


N' 


LYS 


A -• 


204 


17.694 , 


8.379 


16.292 


1.00 


69.51 


18 


CA 


LYS 


A 


204 


18.088 


8.548 


14.905 


1.00 


67.63 


19 


CB 


LYS 


A 


204 


18.876 


7.324 


14.439 


1.00 


68.39 


20 


CG 


LYS 


A 


204 


19.491 


7.454 


13.057 


1.00 


68.94 


21 


CD 


LYS 


A 


204 


20.927 


6.946 


13.047 


1.00 


69.41 


22 


CE 


LYS 


A 


204 


21.072 


5.632 


13.805 


1.00 


69.58 


23 


NZ 


LYS 


A 


204 


22.447 


5.088 


13.716 


1.00 


70.30 


24 


C 


LYS 


A 


204 


16.866 


8.787 


14.025 


1.00 


65.91 


25 


o 


LYS 


A 


204 


16.995 


9.160 


12.858 


1.00 


66.04 


26 


N 


SER 


A 


205 


15.680 


8.574 


14.590 


1.00 


62.82 
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27 


CA 


SER 


A 


205 


14 445 


R R14 


1 0 R54 


i fin 


jy.lo 


28 


CB 


SER 


A • 


205 


, 13.240 


R 959 




i fin 

l.UU 


01/. OD 


29 


OG 


SER 


A 


905 


19 Q04 




1 ^ CAT 


1 AA 
l.UU 


59.4/ 


30 


c 


SER 


A 




14 Hfi 


1U.33 1 


13. /Jl 


1 AA 
1.00 


cc on 
56.82 


31 


o 


SER 


A 
A 




1 ^ 405 


lU.oOl 


13.016 


1 AA 
1.00 


56.27 


32 


N 


T FIT 


I A 




ID. iOO 


1 1 A1 G 

11.015 


1 A C 1 A 

14.510 


l.OO 


53.62 


33 


CA 

> — 'Xx. 


T PTT 


A 




1 C 1/f A 

. 1 0.Z4U . 


12.473 


14.540 


l.OO 


50.75 


34 


PR 


T PTT 


i A 
A 


206 


IoJoz 


12.935 


15.477 


l.OO 


50.59 


35 


V^VJ 


T PTT 


: A 


20 o 


. 1 o.72o 


1 A A*%1 

14.421 


15.542 


l.OO 


49.94 


36 




" T PTT 


A 


ZUo 


.15.5 /3 


15.224 


16.114 


LOO 


49.18 


37 


- pn? 


- T PT T 


A 


zoo 


- 17.972 


14.595 


16.403 


LOO 


49.63 


* 38 


c 


T PTT 


A 
A 


ZUO 


1 5.45*0 


1 O A1 O 

13.018 


in i a i 

13.142 


1 AA 

1.00 


48.97 


39^ 


o . 


TU[I 

JuCU 


A 

A 




. 1D.U1D 


14.09o 


12.795 


1 AA 

LOO 


49.11 


40 • 




AT A 


. -■ A 

: (■ a. 


OAT 


1 /C Old 

..io.Z3y 


1 1 UT1 

12.262 


12*344 


LOO 


46.47 


*41 


PA =~ 


- AT A — 


■ A - 

v ■: .A. ■ m 


OAT 

20/ 


16.555 


12.671 


i a noo 

10.983 


LOO 


45.03 


42 




-AT A 
AuLA 


A ' 


207 


17.428 


11.622 


10.313 


LOO 


44.87 


4^ 


pi 


"-'AT A 
AL.A 


A 

' A 


O A*7 

20/ 


15.291 


12.907 


10.157 


1 r\f\ 

1.00 


44.53 




r\ 


AT A 

. ALA. s 


A 


207 


15.077 


14.006 


9.646 


LOO 


43.47 


45 


"NT - 
■ IN 


T VQ 
, Lio 




ZOo 


14.454 


11 001 

11.881 


1 a Aon 

10.029 


1 AA 

1.00 


43.76 


46 


V^A l 


JL x o 


A 

A 


208 


13.226 


i a rvi 1 

12.011 


9.251 


1 t\f\ 

LOO 


42.20 


47 




T VQ 


- A 

••A v 




1 A /f"71 

12.471 


10.680 


A 1 Ail 

9.194 


1 A A 

LOO 


44.47 


48 - 

to 


PIT- 


T VQ 


A 

■ A 


OAQ 

ZOo 


11.53/ 


1 A C7ZT 

10.576 


*7 AAA 

7.990 


i t\r\ 
LOO 


45.43 


49 


PT> 


T VQ 


A 


ZUo 


10. /ID 


A "2 a A 
7JUU 


*7 AAA 


1 AA 
1.00 


4o.y2 


50 


CE 


T VQ 


A 

A 


90S 


7.00U 


Q 098 




1 fifi 
l.UU 


45 AO 


51 




LYS 


A 


90S 


O. / yy 


R 1 14 


0 050 


1 00 

I .uU 


40 94 


52 


c 


LYS 


A 


208 


12.330 


13 086 


9 852 


1 00 


41 10 


53 


o 


LYS 


A 


208 


11.658 


13.818 


9.128 


1.00 


40.18 


54 


N 


ARG 


A 


209 


12.328 


13.174 


11.179 


1.00 


38.94 


55 


CA | 


ARG 


A 


209 


11.535 


14.172 


11.887 


1.00 


37.26 


56 


CB 


ARG 


. A 


209 


11.745 


14.022 


13398 


1.00 


39.88 


5 7 


CG 


ARG 


A 


209 


11.394 


15.257 


14.226 


1.00 


43.03 


58 . 


CD 


ARG 


A 


209 


9.901 


15.543 


14.240 


1.00 


45.71 


59 


NE 


ARG 


A 


209 


9.622 


16.898 


14.711 


1.00 


46.98 
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:. ;60 


CZ 


ARG; 


< A , 


209 


*' 8.403 


17.416 


14.828 


1.00 


47.56 


, ;61 


NH1 


r 'ARG 


- • A, 


209 


- 7337 


16.691 


14.511 


1.00 


48.63 


:62c 


• NH2v 


ARG 


■ A 


209 


8.251 


18.662 


15.255 


1.00 


47.03 


63 


C 


ARG; 


' ' A . - 


209 


' 11.945 


15.575 


11.437 


1.00 


34.87 


64 : 


or 


"ARG: 1 


" A; 


209 


11.104 


16.388 


11.050 


1.00 


33.63 


65 f 


N . 


: • ILE 


A 


210 


13.243 


15.855 


1J.496 


1.00 


31.60 


• 66 .i 


CA 


'TLB 


•• A 


210 


13.759 


17.156 ■ 


11.086 


; 1.00 


28.45 


67 


CB 


JLE 


• A 


210 


15.283 


17.260 


. 11355 


1.00 


2735 


68 ... 


CG2 


r ILE 


A 


210 


45.873 


18.457 


10.625 


1.00 


26.86 


69 


CGI 




Mr. ;A ' •• 


210 


15.532 


17.376 


12.863 


1.00 


2732 


\ 70 


CD1 


ILE : * 


• A •■■ 


210 


17.002 


17.319 


13.251 


1.00 


26.22 


:, 71 


C 


::v: iLE ; 


A. ';■ 


210 


13.485 


17.354 


9.599 


1.00 


27.44 


. 72 


O 


o ILE « 


A. 


210 


' 13.158 


18.453 


. 9.157 


1.00 


26.71 


73.. 




TYR ; 


A ; 


211 


13.614 


16.277 


8.836 


1.00 


25.68 


74 


CA 


•TYR : 


-c A 


211 


13.370 


16335 


' 7.403 


;1.00 


26.73 


75. 


CB 


. TYR 


A; 


211 


13.782 


15.011 


6.748 


1.00 


25.17 


76 


CG 


-TYR 




211 


13.657 


14.971 


5.236 


.1.00 


27.92 


;77 


CD1 


TYR 


A " 


211 


13.862 


16.116 


4.464 


1.00 


27.10 


78 < 


■ CE1 


TYR 

J 


A 


211 


13.787 


16.071 


3.073 


1.00 


27.40 


79 


i CD2 


TYR 


■ * A 


211 


13.372 


13.775 


4.573 


1.00 


28.20 


80 • 


CE2 


TYR 


A. 


211 


13.296 


13.721 


3.180 


1.00 


29.60 


81 ; 


; CZ 


\ '•' TYR , 


• ' ' A 


211 


13.506 


14.875 


2.437 


1.00 


28.67 


82 


OH 


TYR 


A 


211 


13.446 


14.829 


1.061 


1.00 


30.22 


83 


C 


TYR 


A 


211 


11.892 


16.637 


7.152 


1.00 


2635 


84 


O 


TYR 


A 


211 


11.552 


17.402 


6.253 


1.00 


22.94 


85 


. N 


GLU 


A 


212 


11.010 


16.046 


7.951 


1.00 


27.41 


oo 


L»A 


flT TT 


A 


Z1Z 






7 77R 






87 


CB 


GLU 


A 


212 


8.760 


15.392 


8.700 


1.00 


29.37 


88 


CG 


s GLU 


A 


212 


8.870 


13.917 


8.324 


1.00 


32.72 


89 


CD 


GLU 


A 


212 


7.948 


13.014 


9.129 


1.00 


34.56 


90 


OE1 


GLU ; 


A 


212 


8.053 


12.998 


10372 


1.00 


36.22 


91 


OE2 


GLU . 


' A 


212 


7.121 


12.314 


8.510 


1.00 


36.44 


92 


C 


GLU 


A 


212 


9.297 


17.771 


8.057 


1.00 


25.89 
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93 


O 


GLU- 


. A , 


212 


8.535 


18.407 


7 331 


1 00 


25 26 


94 


N 


ALA 


-7 A",; 


213 


9.926 


18.312 


9 099 


1 00 


93 7S 


95 


CA 


^ALA 


A ~ 


213 


9 746 


1° 712 


9 461 


i on 


99 66 


96 


CB 


ALA 


A 

■Cm. 


213 


10 -594 


20 096 


10 731 

1U. I J L 


1 oo 


94 Oz 


97 


C 


-ALA 




213 


10 921 

X U.XX X 


20 609 

XU.UU7 


R 391 

OOZl 


i on 


71 6fi 
Zl. Oo 


98 


O 


- AT A ' 


A 


913 


7,010 


91 647 


ft 033 


1 AA 

l.UU 


ZU.Uo 


99 


I N 


X X XV 


A 


914 

Z1H 


i 1 1 30R 

1 l.JUO 


90 903 


7 <7£ 


1 AA 

l.UU 


OA 

zu.oy 


100 


CA 


X X XV 


A 


914 
Zl*+ 


1 1 R73 
. ll.o/3 


90 Q^f\ 
jLV.y DO 


0.0 OD 


1 AA 
l.UU 


OA O 1 


101 




XI XV 


A 


914 
Zl*t 


1 7 77A 
1 J.ZZO 
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1.00 


60.65 


256 


CB 


ALA 


A 


233 


9.187 


11529 


-18.907 


1.00 


60.60 


257 


C 


ALA 


A 


233 


8.110 


19.284 


-20 J 12 


1.00 


61.11 
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258 


O 


ALA 


A 


233 


7.648 


19.450 


-21.442 


1.00 


61.12 


259 


" N 


* SER • 


A ' 


234 


7.564 


19.799 


-19.217 


1.00 


61.07 


260 


CA 


SER ' 


A 


234 


6360 


20.609 


-19281 


1.00 


61.06 


261 


CB 


SER 


A 


234 


5.742 


20.748 


-17.885 


1.00 


61.00 


262. 


OG- 


SER • 


- A T 


234 


4.582 


21.563 


-17.917 


1.00 


59.89 


263 


C 


• SER 


A 


234 


6.687 


21.988 


-19.847 


1.00 


61.10 


.264 : 


o 


SER 


A 


234 


7344 


22.107 


-20.883 


1.00 


61.74 


265 


N 


' ASN 


A 


235 


6.230 


23.026 


-19.157 


1.00 


6020 


266 


CA 


ASN^ 


A 


235 


6.452 


24.396 


-19.590 


1.00 


59.10 


. 267 


CB 


ASN 


A 


235 


5.740 


24.614 


-20.934 


1.00 


58.65 


268 


CG 


ASN 


• : A 


235 


5.584 


26.076 


-21.296 


1.00 


58.90 


269 •■ 


OD.l 


' ASN 


■ A 


235 


4.628 


26.728 


-20.872 


1.00 


59.73 


270 


■ ND2 


-ASN: 


,. A . . 


235 


6.52) 


26.601 


-22.081 


1.00 


5839 


271 • 


C 


ASN ' 


A 


235 


5.952 


25349 


-18.502 


1.00 


58.52 


272 


; O 


ASN - 




235 


5.697 


24.917 


-17377 


1.00 


59.67 


273 


N 


ASN. ■ 


A ' \ 


236 


5.807 


26.629 


-18.840 


1.00 


56.14 


?74 


CA 


ASN 


A 


236 


5.393 


27.666 


-17.896 


1.00 


5L26 


275 - 


CB 


ASN, 


A 


236 


4.356 


27.117 


-16.912 


1.00 


52.89 


276 


CG 


ASN 


A 


236 


4.033 


28.087 


-15.799 


1.00 


52.48 


277 « 


OD1 


"ASN 


A -■■ 


236 


3.453 


29.148 


-16.028 


1.00 


52.78 


278 


ND2 


ASN 


A 


236 


4.413 


27.729 


-14.580 


1.00 


52.90 


279 | 


C 


ASN 


A 


236 


6.701 


28.005 


-17.179 


1.00 


47.02 


280 


O 


ASN 


A 


236 


6.751 


28.137 


-15.956 


1.00 


47.14 


281 


N 


PRO 


A 


237 


7.783 


28.170 


-17.966 


1.00 


42.04 


282 


CD 


PRO 


A 


237 


7.651 


28.434 


-19.410 


1.00 


41.47 


283 


CA 


PRO 


A 


237 


9.145 


28.475 


-17.529 


1.00 


38.16 


284 


CB 


PRO 


A 


237 


9.906 


28.523 


1 O- O A£. 

-18.846 


1 AA 

1.00 


io.Oo 


285 


CG 


PRO 


A 


237 


8.920 


29.198 


-19.729 


1.00 


39.97 


286 


C 


PRO 


A 


237 


9338 


29.755 


-16.740 


1.00 


34.87 


287 


O 


PRO 


A 


237 


8.615 


30.734 


-16.921 


1.00 


34.21 


288 


N 


PRO 


A 


238 


10.338 


29.760 


-15.848 


LOO 


32.12 


289 


CD 


PRO 


A 


238 


11.273 


28.667 


-15.531 


LOO 


3231 


290 


CA 


PRO 


A 


238 


10.617 


30.943 


-15.038 


LOO 


2930 
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291 


CB 


PRO 


A 


238 


11.741 


30.470 


-14.114 


1.00 


29.79 


j 292 


CG 


PRO 


A 


238 


12.436 


29.411 


-14.933 


1.00 


32.45 


293 


C 


PRO 


A 


238 


11.045 


32.080 


-15.960 


1.00 


27.36 


294 


O 


PRO 


A 


238 


11.745 


31.861 


-16.952 


1.00 


25 06 


295 


N 


PHE - 


A 


239 


10.606 


33.289 


-15.637 


1.00 


24.30 


296 


CA 


PHE 


A 


239 


10.939 


34.458 


-16.431 


1.00 


23 14 

+*J . X*T 


297 . 


CB 


PHE 


A 


239 


10.004 


35.612 


-16.083 


1 00 


95 OS 


298 


CG 


PHE 


A 


239 


10.111 


36.768 


-17 022 


i on 


96 ^1 


299 


CD1 


PHE 


A 


239 


9.483 


36.729 


-18.262 


1 00 

X »w 


77 99 


300 


CD2 


PHE 


A 


239 


10.874 


37,881 


-16.691 


1.00 


27 41 


301 


CE1 


r PHE 


A 


239 


9 615 


37.784 


-19 162 


i no 




302 


CE2 


PHE 


A 


239 


11 014 

X X • V/ JL"T 


38 940 


-17 583 


i on 


98 0^ 


303 


CZ 


PHE 


A 


239 


10 382 


JO.U71 


-.18 893 
— X o . o^._j 


i oo 


90 1 f\ 


304 


C r 


; . PHE 


A 


239 


12 370 


34 872 

-7*T.O / X. 




i no 


99 55 


.305 


O 


PHE. 


A 


239 


12 723 




-14 97/5 


i on 
i .1/1/ 


17.0J 


306 


N 


r VAL 


* A 


240 


13 189 


35 002 


-17 171 

X / .X / X 


1 00 

X m\J\J 


22 23 


307 


CA 


: VAL 


A 


240 


14 581 


35 377 


-1 6 974 


1 00 


91 89 


308 


CB 


VAL 


A 


240 


15.492 


34 735 


-18 051 


1 00 


23 08 


309 


CGI 


..; VAL 


A 


- 240 


16.919 


35.265 


-17.916 


1.00 


22.71 


310 


CG2 


VAL 


A 


240 


15.487 


33.217 


-17.889 


1.00 


22.78 


311 


C 


VAL 


A 


240 


14.813 


36.884 


-16.959 


1.00 


22.84 


312 


O 


VAL 


A 


240 


14.387 


37.613 


-17.861 


1.00 


22.43 


313 


N 


ELE 


A 


241 


15.482 


37.340 


-15.905 


1.00 


20.77 


314 


CA 


CLE 


A 


241 


15.817 


38.745 


-15.740 


1.00 


20.91 


315 


CB 


ELE 


A 


241 


15.496 


39.223 


-14.306 


1.00 


20.98 


| 316 


CG2 


ELE 


A 


241 


15.897 


40.687 


-14.137 


1.00 


19.06 


317 


CGI 


ELE 


A 


241 


13.999 


39.039 


-14.028 


1.00 


19.24 


318 


CD1 


ILE 


A 


241 


13.607 


39.288 


-12.574 


1.00 


21.68 


319 


C 


ELE 


A 


241 


17.317 


38.844 


-16.006 


1.00 


21.63 


320 


o 


ELE 


A 


241 


18.137 


38.438 


-15.172 


1.00 


19.53 


321 


N 


HIS 


A 


242 


17.673 


39.362 


-17.181 


1.00 


20.99 


322 


CA 


HIS 


A 


242 


19.077 


39.476 


-17.568 


1.00 


22.26 


323 


CB 


HIS 


A 


242 


19.355 


38.567 


-18.769 


1.00 


24.28 
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HIS 


A 


242 


18.588 


38.937 


-19.998 


1.00. 


24.97 


lie 




i fro 

HIS. 


A 


242 


17.459 


38.415 


-20.534 


1.00 


25.77 


326 


NDl 


HIS 


A 


242 


18.951 


39.986; 


-20.816 . 


1.00 


27.65 


327 


CE1 


HIS 


A 


242 


18.077 


40.096 


-21.800. 


LOO, 


25.99 


328 


NE2 " 


HIS 


A 


242 


17 A 62 . 


39.156 


-21.653 


1.00 


26.18 


329 


C 


HIS 


A 


242 


19.537 


40.902 


. -17.868 


1.00. 


23.10 


330 


O 


HIS 


A 


242 


20.721 


41.135; 


-18.111 . 


1.00 


21.51 


331 


N 


ASP 


A 


243 


18.600 


41.847 


-17.848 


1.00 


23.64 


332 


C A 


ASP 


A 


243 


18.910 


43.255^ 


-18.074 


1.00. 


26.61 


:333 


CB 


ASP 


A 


243 


19.170 


43.546 : 


-19.561 


1.00 


27.87 


334 


CG 


ASP 


A 


243 


17.972 


43.258; 


-20.445 


1.00 


29.61 


335 


; ODl 


ASP 


A 


243 


16.851 


43.097 . 


. -19.921 


1.00. 


28.47 


;336 


- OD2 


ASP. 


A 


243 


18.157 


43.206 


-21.683 


1.00 


31.76 


337 


c * 


ASP 


A 


243 


17.792 


44.150 


-17.553 


,1.00 


27.23 


338 ; 


i o 


ASP. 


A 


243 


16.800 


43.668 


-17.006 


1.00... 


25.64 


339 


N 


MET 


' A 


244 


17-953 


45.457 


-17.727 


.11.00 


28.20 


340 


CA 


MET 


A 


244 


16.964 _ 


46.4JA, 


-17.243 


1.00 


28.88 


341 


CB 


MET 


A 


244 


17.466 


47.838 


-17.483, 


1.00 


31.16 


' 342 


CG 


MET 


A 


244 


18.749 


.48.140, 


-16.733 


.1.00 


33.46 


343 


SD 


MET 


A 


244 


18.511 


48.081 


-14.932 


1.00 


36.77 


344 


CE 


MET 


A 


244 


19.258 


49.667 


-14.476 


\.oo 


37.18 


345 


C 


MET 


A 


244 


15.571 


46241 


-17.837 


1.00 


28.34 


346 


o 


MET 


A 


244 


14.567 


46.411 


-17.138 


1.00 


27.10 


347 


N 


GLU 


A 


245 


15.500 


45.895 


-19.118 


1.00 


28.20 


348 


CA 


GLU 


A 


245 


14.206 


45.709 


-19.760 


1.00 


27.38 


349 


CB 


GLU 


A 


245 


14.375 


45.546 


-21.274 


1.00 


30.74 






/"IT TT 


A 

A 


Zhd 


1 1 nil 


A< O/K 


-zi.yyy 


l.UU 


1A AC 

34.40 


351 


CD 


GLU 


A 


245 


13.226 


45.229 


-23.510 


1.00 


37.18 


352 


OE1 


GLU 


A 


245 


14.109 


44.505 


-24,016 


1.00 


38.99 


353 


OE2 


GLU 


A 


245 


12.455 


45.936 


-24.191 


1.00 


39.82 


354 


C 


GLU 


A 


245 


13.452 


44.508 


-19.189 


1.00 


26.77 


355 


O 


GLU 


A 


245 


12.274 


44.615 


-18.844 


1.00 


24.55 


356 


N 


THR 


A 


246 


14.126 


43366 


-19.086 


1.00 


25.14 
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357 


CA 


THR 


A 


246 


13.485 


42.169 


-18.550 


1.00 


22.92 


358 


CB 


THR 


A 


246 


14.342 


40.909 


-18.802 


1.00 


22.22 


359 


OG1 


THR 


A 


246 


15.686 


41.135 


-18362 


1.00 


20.92 


360 


CG2 


THR 


A 


246 


14.352 


40.571 


-20.288 


1.00 


22.08 


361 


C 


THR 


A 


246 


13.172 


42.299 


-17.058 


1.00 


22.64 


362. 


. O 


TUR 


A 


246 


12.284 


41.615 


-16.545 


1.00 


21 75 


363 


N 


LEU 


A 


247 


13.896 


43.171 


-16360 


1.00 


21.77 


364 


CA 


LEU 


A 


247 


13.630 


43386 


-14.936 


1.00 


20 25 


365 


CB 


LEU 


\ A 


247 


14.698 


44.278 


-14.289 


1.00 


20.20 


366 


CG 


LEU 


A 


247 


14.339 


44.724 


-12.861 


1.00 


18.26 


.367 , 


CD1 


LEU 


A 


247 


14.285 


43.498 


-11.941 


1.00 


19.55 


368 


CD2 


LEU 


, A 


247 


15.367- 


45.723 


-12342 


" 1.00 . 


17.68 


369 


C 


LEU 


A 


247 


12.280 


44.081 - 


--14.833 


1.00 


20.44 


,370 


O 


LEU - 


.. A 


247 


11.417 


43.678 


-14.064 


1.00 


19.24 


371 m 


. . N 


.. :r CYS 


. A 


248 


12.094 


45.126 


-45.630 


1.00 


20.05 


. 372 _ 


i . CA 


CYS 


A 


248 


10.833 


45.856 


-15.606 


1.00 


22.13 


373. _ 


CB 


CYS : 


A 


248 


10.904 


47.076 


-16.536 


1.00 


22.88 


374 . 


SG 


CYS 


A - 

v * » • : 


248 


12.090 


48.355 


-16.013 


1.00 


28.89 


375 


C 


CYS 


. T A 


248 


9.681 


44.939 


-16.016 


1.00 


22.46 


376 


o 


CYS 


. A 


248 


8.594 


45.020 


-15.452 


1.00 


23.21 


377 


N 


- MET 


A 


249 


9.921 


44.067 


-16.994 


1.00 


24.21 


378. 


CA 


MET 


A - 


249 


8.889 


43.139 


-17.465 


1.00 


25.70 


379 


CB 


MET 


A 


249 


9.392 


42.333 


-18.668 


1.00 


28.67 


380 


CG 


MET 


A . 


249 


9.874 


43.175 


-19.847 


1.00 


34.04 


381 


SD 


MET 


A . 


249 


10.480 


42.192 


-21 .255 


1.00 


38.98 


382 


CE 


MET 


A 


249 


10.112 


43318 


-22.621 


1.00 


38.27 


383 


C 


MET 


A 


249 


8.504 


42.178 


-16.342 


1.00 


25.87 


384 


O 


MET 


A 


249 


7.323 


41.924 


-16.098 


1.00 


25.43 


385 


N 


ALA 


A 


250 


9.511 


41.646 


-15.657 


1.00 


24.88 


386 


CA 


ALA 


A 


250 


9.268 


40.720 


-14.559 


1.00 


23.76 


387 


CB 


ALA 


A . 


250 


10.590 


40.188 


-14.022 


1.00 


22.98 


388 


C 


ALA 


. A 


250 


8.482 


41.406 


-13.446 


1.00 


22.99 


389 


O 


ALA 


- A 


250 


7.546 


40.829 


-12.889 


1.00 


23.42 
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In 


GLU 


- A 


251 


8.863 


42.639 


-13.121 


1.00 


22.90 


35/1 


CA 


GLU 


A 


251 


8.176 


43387 


-12.075 


1.00 


.2330 


392 


CB 


GLU. 


*' A 


251 


8.840 


44.756 


-11.865 


i.1-00 . 


23.04 


393 


CG 


GLU 


A 


251 


10.269 


44.676 


"J 1 - 33 . 6 . 


1^00 


23.77 


394 ■ 


CD 


GLU 


A 


251 


10.969 


46.025 


-11.283 


1.00 


2532 


395 


OE1 


GLU 


A 


251 


I 10.914 


46.766 


-12.288 


1.00 


26.09 


396 


OE2 


GLU 


A 


251 


11.590 


46.341 


-10.247 


.100 


24.65 


397 


! c 


GLU 


A 


251 


6.713 


43.572 


-12.465 


1.00 


24.60 


398 


o 


GLU 


A 


251 


5.815 

- —i- 


43.401 


-11.646 


.100. 


23.99 


* 399 




LYS 


A 


252 


6.482 


43.911 


-13.728 


1.00 


26.88 


400 


CA 


LYS 




252 


5.129 


44.120 


-14.227 


} QP 


.2938 


401 


CB 


LYS 


A 


252 


5.163 


44.406 


-15.729 


;1.00 


31.11 


402 


CG 


LYS 


A ; 


252 


3.792 


44.688 


-16.321 


1.00 


34.55 


403 ~ 


CD 


: LYS 


A; 


252 


3-707 


46.086 


-16.917 


1.00 


38.59 


404 


: CE : 


LYS 


A 

m \ 


252 


' 4.142 


47.146 


-15.921 




40.59 


405 


. NZ - 


LYS 


' A: • 


252 


5.625 


47.202 


-15.814 


1.00 


42.89 


406 


C 


"LYS 


At • 


252 


4.229 


42.912 


-13.965 


100 


30.43 


407 


o 


: LYS 


■ Ai ' ■; 


252 


3.045 


43.059 


-13.656 


1.00 


30.61 


408 


N : 

■i ■ • 


THR 


A; 


253 


4.797 


41.717 


-14.083 


1.00 


31.64 


-409 


j CA 


; THR 


A 


253 


4.035 


40.492 


-13.879 


1.00 


32.66 


410 


CB 


THR 


A ; 


253 


4.508 


39.385 


-14.840 


1.00 


34.32 


411 


OG1 


THR 


A 


253 


4.522 


39.884 


-16.185 


1.00 


37.49 


412 


CG2 


THR 


A 


253 


3.564 


38.191 


-14.767 


1.00 


34.76 


413 


C 


THR 


A 


253 


4.076 


39.923 


-12.459 


1.00 


32.57 


414 


O 


THR 


A 


253 


3.033 


39.621 


-11.876 


1.00 


33.27 


415 


N 


LEU 


A 


254 


5.275 


39.784 


-11.904 


1.00 


31.54 


*f ID 




• T "CTT 


A 


254 


C A A 1 

5.441 


Jy.20o 


-10.3 /2 


1.00- 


11 OA 

31. o4 


417 


CB 


LEU 


A 


254 


6.805 


38.522 


-10.489 


1.00 


32.07 


418 


; CG 


LEU 


A 


254 


6.933 


37.131 


-11.126 


1.00 


33.07 


419 


CD1 


LEU 


A 


254 


5.781 


36.849 


-12.083 


1.00 


32.88 


420 


CD2 


LEU 


A 


254 


8.271 


37.040 


-11.833 


1.00 


32.53 


421 


C 


' LEU 


A 


254 


5.256 


40.128 


-9.367 


1.00 


32.52 


422 


O 


LEU 


A 


254 


4.770 


39.690 


-8321 


1.00 


31.50 
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423 


N 


VAL 


A. 


255 


5.654 


41.389 


-9.501 


1.00 


32.08 


. 424 - 


CA 


VAL 


- A 


255 


5.514 


42.353 


-8.413 


1.00 


32.49 


425 


CB 


VAL - 


A 


255 


6.870 


42.614 


-7.710 


1.00 


32.30 


426. 


CQ1 


| VAL 


A 


255 


6.698 


43.659 


-6.620 


1.00 


33.97 


427 


CG2 


, VAL • 


A 


255 


7.393 


41.322 


-7.095 


1.00 


33.63 


428 


C 


VAL 


A 


255 


4.988 


43.650 


-9.017 


1.00 


11 Sfi 

J J.JO 


429 


O 


VAL 


A 


255 


5.664 


44.680 


-9.018 


1.00 


17 17 


430 


N 


ALA 


A 


256 


3.766 


43.571 


-9.532 


1.00 


1A <37 


431 


CA 


ALA 


A 


256 


3.092 


44.685 
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1 R 7 AO 
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HO 
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11 141 


1 5 Q65 


1 OO 
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2548 


O 




q 
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7R HQQ 

Z0.U77 


54 140 
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ft 01 1 


1 oo 
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XXV/Xl 


' . q 
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6 66Q 


90 044 
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1 00 


66 41 
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9550 


r\ 

kj ■> 




q 
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O 1R1 


51 495 
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1 OO 
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9551 




JliJxl 


C 
o 
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OA MA 
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1 OO 


44 71 
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rs 
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rlUxi 
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o 
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1 OO 
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9551 


KJ 


JtlUxl 


q 
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97 414 


71 147 


90 041 


1 00 


46 70 
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r\ 

KJ 


• TTfYPT 
xlUrl 


q 
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9 179 


10 01 R 
J7.U16 


O 509 


1 00 
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2555 


» KJ 


rxKJtt 


q 
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1 0 676 • 


14 791 


1 1 054 


7 00 


54 77 


2556 


n 


rxKJxx 


q 


19R 


11 515 




1 1 798 

4 1./ X.O 


1 00 


18 27 


2557 


o 


HOH 


q 


19Q 


10 971 


41 414 


19 468 


1.00 


56.22 


2558 


o 


HOH 


S 


130 

•J -J V/ 


31.821 


6.860 


14.047 


1.00 


49.62 


2559 


o 


HOH 


s 


331 


34.176 


36.382 


-11.484 


1.00 


54.81 


2560 


o 


HOH 


s 


332 


16.341 


29.905 


-0.825 


1.00 


50.86 


2561 


o 


HOH i 


s 


333 


16.034 


14.050 


-0.096 


1.00 


26.81 


2562 


o 


HOH 


s 


334 


35.569 


39.304 


6.909 


1.00 


58.70 


2563 


o 


HOH 


s 


335 


7.222 


18.767 


-13.380 


1.00 


48.09 


2564 


o 


HOH 


s 


336 


4.343 


25.758 


17.613 


1.00 


48.32 


2565 


o 


HOH 


s 


337 


23.980 


49.898 


20.317 


1.00 


45.46 


2566 


o 


HOH 


s 


338 


-1.308 


39.173 


20.570 


1.00 


47.06 


2567 


o 


HOH 


s 


339 


29.401 


35.057 


-13.568 


1.00 


51.53 
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2568 


O 


HOH 


s 


340 


22.171 


11 944 


-8.266 


1 00 


58 3f 

JO.JO 


256? 


O ... 


] HOH 


' . s 


341 


28.556 


23.420 


-8.393 


1 00 


38 50 


25.70 


0 


HOH 


•S 


342 


16 039 


7 600 


• 25 055 


1 00 


63 84 


2571 


or 


HOH 


s . 
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34 042 


9 843 


1 00 


^7 04 
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I*f.O f*r 
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A A OO 
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nun 
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l.UU 
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DZ.OJ 
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o 

U 
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o 
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l.UU 
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2575 


n 

u 
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9£ 1 1 Q 
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l.UU 
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7fl 7A7 
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jU.j lo 
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l.UU 
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o 
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74 1 10 
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l.UU 
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3U. f 1 
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1/ coo 
34.D^Z 
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i nn - 

l.UU 


no 

i 
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1- 1 KA 
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l.UU 
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sy *dh 


9580 


u 
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5J 
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ZZ. 1 /Z 
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l.UU 
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u 
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l.UU 
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U 
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• 1 fi <£7 Q 
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A7 
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i n fi^ft 
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l.UU 


iq on 
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u 




Q 
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DDO 


oo 055 


<1 AA7 
J 1 .uo / 


1 1 5270 
-1 1.3 /y 


1 nn 

l.UU 


59 57 


2585 


u - 


nun 


o 
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1 1.01 / 


^1 991 
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1 on* 

l.UU 


6*7 06 


2586 


n 


nun 




75R 


7 OS7 


i o A71 

i y .*+d i 


' 74 400 

-z^.nyy 


i no- 

l.UU 


45 57 


25 R7 


U •■ 


nun 


5! . 


75Q 


77 fiQfi 
ZZ. Uyo 


ah nil 


1 4^1 


1 nn 

l.UU 


61 45 
01.** D 


2588 


1 u 


nun 


G 


760 


79 A77 


jvi 770 


• 31 096 
D 1 .l/ZO 


1 00 


47 98 


2589 


o 


nun 


c . 


361 

DO I 


30 447 


7H 0^4 


90 877 


1 00 

1 .UU 


57 9S 


2590 


o 


HOH 

XX\^/XX 


s 


362 


35 833 


38 09S 


1 980 


1 00 

X ivw 


61.81 


2591 


0 


HOH 


s 


363 


6 113 


31 803 

J 1 .07J 


-17 272 


1 00 

X .vw 


49.61 


2592 


0 


HOH 


s 


364 


6.724 


15.601 


-22.408 


1.00 


59.42 


2593 


0 


HOH 


s 


365 


12.111 


1.178 


18316 


1.00 


48.35 


2594 


o 


HOH 


s 


366 


4.240 


32.098 


1.799 


1.00 


44.18 


2595 


0 


HOH 


s 


367 


29.724 


35.065 


27.753 


1.00 


55.76 


2596 


o 


HOH 


. s 


368 


26.360 


7.462 


25.028 


1.00 


51.90 


2597 | 


o 


HOH 


s 


369 


9.107 


46.874 


3.890 


1.00 


56.70 


2598 


o 


HOH 


s 


370 


10.650 


46.430 


-20.344 


1.00 


37.21 


2599 


o 


HOH 


s 


371 


8.233 


48.961 


17.197 


1.00 


59.01 


2600 


0 


HOH 


s 


372 


20.322 


15.007 


-8.818 


1.00 


55.36 
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2601 


O 


HOH 


S 


373 


19.260 


-0.126 


16.569 


1.00 


45.30 


2602 


O 


, HOH . 


- S 


374 


7.343 


49.778 


-0.668 


1.00 


55.02 


2603 


O 


HOH 


S 


375 


2.507 


47.867 


0.719 


1.00 


48.12 


. 2604 


O 


HOH 


S 


376 


13.136 


54.591 


-2.145 


1.00 


53.44 


2605 


O . 


- HOH 


. s - - 


377 


17.466 


16.519 


30.208 


1.00 


49.11 


2606 


O 


HOH 


s 


378 


20.118 


10.769 


23.252 


1.00 


55.34 


2607 


O 


HOH 


s 


379 


23.542 


12.685 


26.089 


1.00 


62.62 


2608 


O 


HOH 


s 


380 


20.375 


47.120 


12.159 


1.00 


51.92 


2609 


O 


HOH 


s , 


381 


9.032 


10.924 


17.726 


1.00 


57.21 


2610 


o 


HOH 


s 


382 


15.077 


15.226 


30.470 


1.00 


56.74 


2611 


o 


HOH 


s 


383 


27.823 


7.313 


-8.539 


1.00 


49.75 


2612 


o 


HOH 


s 


384 


-1.210 


31.371 


18.699 


1.00 


53.14 


2613 


o 


HOH 


s 


385 


7.322 


35.525 


28.034 


1.00 


61.44 


2614 


o 


HOH 


s 


386 


14.217 


16.674 


-13.221 


1.00 


42.98 


2615 


O ; 


pHOH 


s . . .. 


387 


20.078 


59.930 


-2.925 


1.00 


54.66 


2616 


o 


HOH 


s 


388 


28.936 


2.000 


15.009 


1.00 


47.04 


2617 


, o 


HOH 


s 


389 


14.166 


28.692 


31.577 


1.00 


59.65 


2618 


o 


HOH 


s 


390 


20.846 


52.499 


14.026 


1.00 


61.68 


2619 


o 


HOH 


s 


391 


13.294 


5.368 


25.444 


1.00 


52.09 


2620 


o 


HOH 


s 


392 


26.874 


59.625 


-2.846 


1.00 


46.54 


2621 


9 


HOH 


s 


393 


11.393 


5392 


16.760 


1.00 


45.29 


2622 


o 


HOH 


s 


394 


30.608 


57.650 


11.108 


1.00 


63.95 


2623 


o 


HOH 


s 


395 


33.858 


39.183 


-13.250 


1.00 


5428 


2624 


o 


HOH 


s 


396 


16.117 


14.396 


-7.011 


1.00 


46.14 


2625 


o 


HOH 


s 


397 


37.725 


38.263 


6.165 


1.00 


64.28 


2626 


o 


HOH 


s 


398 


6.555 


42.050 


-19.474 


1.00 


48.56 


2627 


CIA 


735 


c 


1 


19.341 


40.726 


3.997 


1.00 


15.91 


2628 


OIC 


735 


c 




18.234 


40.214 


4328 


1.00 


17.26 


2629 


OIB 


735 


c 




20.387 


40.473 


4.647 


1.00 


17.17 


2630 


C1D 


735 


c 




19.458 


41.690 


2.760 


1.00 


14.73 


2631 


C1X 


735 


c 




19.838 


43.096 


3.263 


1.00 


16.66 


2632 


C1Y 


735 


c 




18.087 


41.870 


1.985 


1.00 


16.99 


2633 


OlE 


735 


c 




20.617 


41.213 


1.907 


1.00 


15.62 
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1 
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C1J 
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2644 


5 : S2P - 


755 

133 
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1 00 
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1 /.l JO 
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1 .uu 


1 R 7£ 
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73 5 
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17 7Q3 
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Jo. HOD 
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1 00 
l.UU 


10 7fl 
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3R ^SR 
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•>J9*E 
: INaJB 




p 




1 6 0751 


ao OIQ 
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■ -O.Ja**\ 


1 oo 

. l.UU 


17 Kft 

.1 / .Oo 


2640 
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p 




15 fvl 4 


3R 7R0 
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. -2.yH3 


1 00 
1 .uu 
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2650 


r*2W 
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30 1R6 
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1 00 
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2651 
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/J J 


p 
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50 004 
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1 00 
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2652 
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p 




19 0R4 


30 400 


-7 096 


1.00 


22.73 


2653 


C2M 


735 


c ... 


J 


12.305 


39.993 


-8.391, 


1.00 


20.47 


2654 


C2K 


735. 


c . 




13.651 


40.175 


-8.842 


1.00, 


23.90 


2655 


£21 


... 735. 


c 




14.735 


39.782 


-8.042 


1.00 


22.13 


2656 


. ,C2N 


735 


c 




11.115 


40.435 


-9.308 


1.00 


25.97 


2657 


F2P 


735 . 


c 




10.952 


39.499 


-10.296 


1.00 


31.88 


2658 


F2Q 


-735 .- 


c 




9.934, 


40.532 


-8.627 


1.00 


31.88 


2659 


F20 


735 


c 




11.362 


41.644 


-9.892 


1.00 


31.88 
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TABLE 3 

ATOMIC STRUCTURE COORDINATE DATA OBTAINED FROM X-RAY 
DIFFRACTION FROM THE LIGAND BINDING DOMAIN OF PPARq USED IN 
MOLECULAR REPLACEMENT 



ATOM 


ATOM 
TYPE 


■RRSTDT TF 

IX CO 1 1 ' Ul> 


X Xvv_/ X XZJLLN 
# 

TT 


ff 


v 

Xk, 


V 
X 


T 

JL, 




Jo 


1 


CB 


ASP 


A 


91 1 

Zl 1 


14^1 
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1 O QA O 


1.00 
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2 


CG 
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A 

/V 
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Zl 1 
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A no^ 
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ZO.Ooo 


1 Art 

1.00 


*7rt a 

79.33 


3 




ASP 


A - 


911 
Zl 1 


IjJOj 


J.J J3 




1 Art 

1.00 


OA Ol 

oO.ZJ 


4 


OD2 


ASP 


A 


91 1 
Zl 1 


i4.yoi 


'X 409 


9A Oflff 


1 AA 
l.OO 


OA 1.4 


5 


C 


ASP 

• /lux 


A 


91 1 
Zl 1 


1 ^ 941 
13.Z*tl 


7 37^ 
/.J /3 


1S.Z/Z 


1 AA 
l.OO 


/O.JJ 


6 


O 


ASP 
nor 


A 


91 1 
Zll 


14 371 


R 1 7£ 
o.l /O 


17 Q90 


1 nn 
l.OO. 


/O.30 


7 


N 


ASP ; 


A 
XV 


91 1 
Zll 


1#S 1AA- 


3.UOO 


- 1 C AO O 

lo.OZy 


1 AA 

1 .OO ; 


/ / .JO 


8 


PA 


ASP 
X\jjx 


A 


91 1 
Zl 1 


1 14G93 
. 1*KJJZJ 


< SCO 


1 O Af\Q 


l.OO 


*7T AO 

/ /.Oo 


9 


"M 
atn 


T PTT 


A 

XV . 


919 
Z1Z 


1 A AO 


7 7*35 


1 Q <AQ 


1 AA 
l.OO 


*7/I OA 


10 


CA 




A 


919 


1 QOfL 
lOCXO 


Q 19<? 
- y*lZ3 


1 R 41 51 
1 o.*f JO 


1 AA 


79 O/i 


11 


wJJ 


T PTT 


A 

X\. 


919 
Z1Z 


1 8 149 


Jf.JJO 


1 O 19Q 
li/.JZo 


1 AA 
- 1 .OO 


73 

/J.JO 


12 


CG 


LEU 


A 


919 


IS ROQ 


1U. / VJ 


1Q OQS 


1 ftft 




13 


CD1 


LEU 




919 


IS 094 


11 83 


10 007 


1 no. 




14 


CD2 


LEU 


A 


212 


20.274 


10.68 


19.492 


1.00 


73.65 


15 


C 


LEU 


A 


212 


17.315 


9.398 


16.997. 


1.00 


71.55 


16 


o 


LEU 


A 


212 


17.304 


10.54 


16.548 


LOO 


71.54 


17 


N 


LYS 


A 


213 


17.672 


8.346 


16.274 . 


1.00 


69.51 


18 


CA 


LYS 


A 


213 


18.069 


8.512 


14.89 


1.00 


67.63 


19 


CB 


LYS | 


A 


213 


18.837 


7.279 


14.416 


1.00 


68.39 


20 


CG 


LYS 


A 


213 


19.477 


7.423 


13.047 


1.00 


68.94 


21 


CD 


LYS 


A 


213 


20.908 


6.9 


13.051 


1.00 


69.41 


22 


CE 


LYS 


A 


213 


21.036 


5.602 


13.837 


1.00 


69.58 


23 


NZ 


LYS 


A 


213 


22.361 


4.97 


13.652 


1.00 


7030 


24 


C 


LYS 


A 


213 


16.845 


8.769 


14.018 


1.00 


6551 


25 


o 


LYS 


A 


213 


16.97 


9.16 


12.856 


1.00 


66.04 


26 


N 


SER 


A 


214 


15.661 


8.558 


14.587 


1.00 


62.82 


27 


CA 


SER 


A 


214 


14.425 


8.817 


13.86 


1.00 


59.48 
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28 


CB , 


SER 


A 


214 


13.216 


8267 


14.624 


1.00 


59.65 


29, - 


OG 


SER 


A - 


214 


12.983 




15.815 


1.00 


59.47 


30 


C 


SER 


A - 


214 


14.326 


10.34 


13.76 


1.00 


56.82 


. 31*. 


; O 


, SER . 


A 


214 


13.501 


10.87 


13.019 


1.00 


56.27 


?- 32 r- 


N • : 


LEU 


A '" 


215 


15.183 


11.01 


14.524 


LOO 


53.62 


■33.- 


CA- : 


LEU 


A. - 


215 


15.244 


12.47 


14.553 


1.00 


50 r 75 


34 


CB 


LEU , 


A • 


215 


' 16.368 


12.93 


15.49 


1.00 


50 ; 59 


35 


CG 


LEU 


A 


215 


16.736 


14.42 


15.548 


1.00 


49.94 


"'36 


CD1 


LEU 


A 


215 


15.58 


15.23 


16.111 


1.00 


49: 18 


370< . 


CD2 


•;■ LEU i 


< A ■■■■ 


215 


17.975 


14.6 


16.414 


1.00 


49.63 


38-" 


c 


• LEU • 


■■ A; -■ 


215 


15.496 


13.02 


13.155 


1.00 


48.97 


-39 " 


o > 


l LEU 


i <■ • a : 


-215 


15.029 


^ 14.1 


: 12.809 


1.00 


49.11 


40H, 


N 


ALA , 


' A - 


216 


16.238 


12.26 


12.355 


1.00 


46.47 


41 • 


; CA 


ALA 


•- ' A : ■ 


216 
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It will be understood that various details of the invention may be changed without 
departing from the scope of the invention. Furthermore, the foregoing description is for the 
purpose of illustration only, and not for the purpose of limitation — the invention being 
defined by the claims. 
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- •-: - }.> t * : • CLAIMS' -< ; ■■ -T •'■ <>- . =vt: * ^ : " * 
What is claimed is: j 

1. A substantially pure PPARa ligand binding domain polypeptide 
..in crystalline form;. ; i •. : : - : 

2. The polypeptide of claim 1, wherein the crystalline form has 
lattice constants of a = 61.3 A, b = 103.5 A, c = 49.9 A, a = 90°, p = 90°, y = 

3. The polypeptide of claim 1 or 2, wherein the crystalline temi is 
an orthorhombic crystalline form. 

4. The polypeptide of claim 1 or 2, wherein I the crystalHhe^oiTn has 
a space group of P2i2i2. 

5. The polypeptide of claim 1 or 2, Wherein 5 the - PPARa ligand 
binding domain polypeptide has the amino acid sequence shown in SEQ ID 

NO: 4. or: ... • , ' •"" . 

6. The polypeptide of claim 1 or 2, wherein the PPARa ligand 
binding domain polypeptide is in complex with a ligand. 

7. The polypeptide of claim 1 or 2, wherein the PPARa ligand 
binding domain has a crystalline structure further characterized by the 
coordinates corresponding to Table 2. 

'<8. The polypeptide of claim 1 or 2, wherein the crystalline form 
contains one PPARa ligand binding domain polypeptide in the asymmetric 
unit. 

9. The polypeptide of claim 1 or 2, wherein the crystalline form is 
such that the three-dimensional structure of the crystallized PPARa ligand 
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binding domain polypeptide can be determined to a resolution of about 1.8 A 
or better. 

10. The polypeptide of claim 1 or 2, wherein the crystalline form 
contains one or more atoms having a molecular weight of 40 grams/mol or 
greater. 

11. The polypeptide of claim 1, wherein the crystalline form has 
lattice constants of a = 95.58 A, b = 122.06 A, c = 122.10 A, a = 90°, (3 = 90°, 
Y = 90°_ . . . . . r s - . , 

12. The polypeptide of claim 1 or 11, wherein the crystalline form is 
an orthorhombic crystalline form. , a > 

13. The polypeptide of claim 1 or 11, wherein the crystalline form 
has a space group ofP2|2i2i. . ; <: , 

14. The polypeptide of claim 1 or 11, wherein the PPARa ligand 
binding domain polypeptide has the amino acid sequence shown in SEQ ID 

NO: 4. v ..... 1|;>i , : - 

15. The polypeptide of claim 1 or 11, wherein the crystalline form 
contains four PPARa ligand binding domain polypeptides in the asymmetric 
unit. r 

16. The polypeptide of claim 1 or 11, wherein the crystalline form is 
such that the three-dimensional structure of the crystallized PPARa ligand 
binding domain polypeptide can be determined to a resolution of about 2.5 A 
or better. 
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- 17. c The polypeptide of claim 1 or 11, wherein the crystalline form 
contains one or more : atoms -having a molecular weight of 40 grams/mol or 
greater. . . , ....... \- : * 

18. A method for determining .the three-dimensionial structure of a 
crystallized PPARa ligand binding domain polypeptide to a resolution of about 
1.8 A or better, the method comprising: 

(a) crystallizing a PPARa ligand binding domain polypeptide; and 

(b) analyzing the PPARa ligand binding domain polypeptide to 
determine the three-dimensional structure of the crystallized 
PPARa ligand binding domain polypeptide, whereby the three- 
dimensional structure^ of a crystallized; PPARa ligand binding 
domain polypeptide is- determined to a resolution of about 1 .8 A 
or better, ••/••vv. r - 

19. The method of claim 18/ wherein the analyzing is by X-ray 
diffraction. : , , : . ^ ? : v, • ; , 

20. The method of claim 18, wherein the crystallization is 
accomplished by the hanging drop vapor diffusion method, and wherein the 
PPARa ligand binding domain is mixed with an equal volume of reservoir. 

21. The method of claim 20, wherein the reservoir comprises 4-8% 
PEG 3350, 100-200mM NaF, and 12-16% 2,5 hexanediol. 

22. The method of claim 20, wherein the reservoir comprises 50 mM 
bis-tris-propane, 4-6% PEG 3350, 150 mM NaN0 3 , 16% 2,5 hexanediol and 
1-3 mM YCI. - 

..; 23. A method of generating a crystallized PPARa ligand binding 
domain polypeptide, the method comprising: 
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(a) incubating a solution comprising a PPARa ligand binding 
domain with an equal volume of reservoir; and 

(b) crystallizing the PPARa ligand binding domain polypeptide using 
the hanging drop method, whereby a crystallized PPARa ligand 
binding domain polypeptide is generated. 

24. A crystallized PPARa ligand binding domain polypeptide 
produced by the method of claim 23. 

25. A method of designing a modulator of a PPAR polypeptide, the 
method comprising: \ v rv r , 

(a) designing a potential modulator of a PPAR polypeptide that will 
. make interactions with amino acids In the ligand binding site 

based upon a crystalline structure of a PPARa ligand binding 
domain polypeptide; 

(b) synthesizing the modulator; and 

(c) determining whether the potential modulator modulates the 
activity of the PPAR polypeptide, whereby a modulator of a 

- PPAR polypeptide is designed. 

26. A method of designing a modulator that selectively modulates 
the activity of a PPAR polypeptide the method comprising: 

(a) obtaining a crystalline form of a PPARa ligand binding domain 
polypeptide; , * 

(b) evaluating the three-dimensional structure of the crystallized 
PPARa ligand binding domain polypeptide; and 

(c) synthesizing a potential modulator based on the three- 
dimensional crystal structure of the crystallized PPARa ligand 
binding domain polypeptide, whereby a modulator that 
selectively modulates the activity of a PPARa polypeptide is 
designed. 
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k 27. The method of claim 26, wherein the nriethod further comprises 
contacting a PPARa ligand binding domain polypeptide with . the potential 
modulator; and assaying the PPARa ligand binding domain polypeptide for 
binding of the potential modulator, for a change in activity of the PPARa ligand 
binding domain polypeptide, or both. r : ; - '\t. { &-»? 



28. The method of claim 26, wherein the crystalline form is in 
orthorhombic form. 

-29. The method of claim 28, wherein the crystalline form is such that 
the three-dimensional structure of th£ ^ crystallized s PPARd ligand binding 
domain polypeptide can be determined to a resolution of about 1.8 A or better. 

30. A method of screening a plurality 

of a PPAR ligand binding domain polypeptide, the method comprising: 

(a) - providing a library of test samples; ' } 

(b) contacting a crystalline 1 PPARa ligand binding domain 
polypeptide with each te4t sample; 

(c) detecting an interaction between a test 5 sample and the 
crystalline PPARd ligand binding domain polypeptide; 

(d) 1 identifying a test sample that Interacts with the crystalline 

PPARa ligand binding domain polypeptide; and 

(e) isolating a test sample that interacts with the crystalline PPARa 
ligand binding domain polypeptide, whereby a plurality of 
compounds is screened for a modulator of a PPAR ligand 
binding domain polypeptide. 

31 . The method of claim 30, wherein the test samples are bound to 
a substrate. 

32. The method of claim 30, wherein the test samples are 
synthesized directly on a substrate. 
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33. A method for identifying a PPAR modulator, the method 
comprising: r c 4 . 

(a) providing atomic coordinates of a PPARa iigand binding domain 
to a computerized, modeling systemjand - ■ - 

(b) modeling ligands that fit spatially jnto. the binding pocket of the 
PPARa Iigand binding domain to thereby identify a PPAR 
modulator, whereby a PPAR modulator is identified. 

34. The method of claim 33, wherein the method further comprises 
identifying in an assay for PPAR-mediated activity a modeled Iigand which 
increases or decreases the activity of the PPAR. 

35. A method of identifying a PPARa modulator that selectively 
modulates the activity, of, a PPARa - polypeptide , compared to other 
polypeptides, the method comprising: •-.-.. ;r 

(a) providing atomic coordinates of a PPARa Iigand binding domain 
to a computerized modeling system; and 

(b) modeling a Iigand that fits into the bindjng pocket of a PPARa 
Iigand binding domain and that interacts with conformational^ 
constrained residues of a PPARa conserved among PPAR 
subtypes, whereby a PPARa modulator that selectively 
modulates the activity of a PPARa polypeptide compared to 
other polypeptides. 

36. The method of claim 35, wherein the method further comprises 
identifying in a biological assay for PPARa activity a modeled Iigand that 
selectively binds to said PPARa and increases or decreases the activity of 
said PPARa. 

37. A method of designing a modulator of a PPAR polypeptide, the 
method comprising: 

(a) selecting a candidate PPAR Iigand; 
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...... (b) determining which amino acid "or -amino acids of a PPAR 

:;. polypeptide interact. with^the- ligand fusing a^three-dimensional 
model of a crystallized protein comprising a PPARa LBD; 

(c) identifying in a biological assay for "PPAR activity a degree to 
- - which the ligand modulates the activity of the PPAR polypeptide; 

(d) selecting a chemical modification of the ligand wherein the 
interaction between the amino acids of the PPAR polypeptide 
and the ligand is predicted to be modulated by the chemical 
modification; /:< - : 

(e) performing the chemical modification on the ; ligand to form a 
modified ligand; * ^ r v ^ ■ ■' < r " 

\ (f). contacting the modified ligand with the PPAR polypeptide; 

(g) identifying in a biological assay for PPAR activity a degree to 
which the modified ligand modulates the biological activity of the 
PPAR polypeptide; and ~ - erf u A>&>?x> - 

(h) comparing the biological activity of the PPAR polypeptide in the 
presence toft modified ligand with the biological activity of the 
PPAR polypeptide in the presence of the unmodified ligand, 
whereby a modulator of a PPAR polypeptide is designed. 

38. The method of claim 37, wherein the PPAR polypeptide is a 
PPARa polypeptide. 

. 39. The method of claim 37, wherein the three-dimensional model of 
a crystallized protein is a PPARa LBD polypeptide with a bound ligand. 

40. The method of claim 37, wherein the method further comprises 
repeating steps (ai) through (f), if the biological activity of the PPAR 
polypeptide in the presence of the modified ligand varies from the biological 
activity of the PPAR polypeptide in the presence of the unmodified ligand. 
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41 . An assay method for identifying a compound that inhibits binding 
of a ligand to a PPAR polypeptide, the assay method comprising: 

(a) incubating a PPAR polypeptide with a ligand in the presence of 
a test inhibitor compound; 

(b) determining an amount of ligand that is bound to the PPAR 
polypeptide, wherein decreased binding of ligand to the PPAR 
protein in the presence of the test inhibitor compound relative to 
binding of ligand in the absence of the test inhibitor compound is 
indicative of inhibition; and 

(c) identifying the test compound as an inhibitor of ligand binding if 
decreased ligand binding is observed, whereby a compound that 
inhibits binding of a ligand to a PPAR polypeptide is identified. 

42. A method of identifying a PPAR modulator that selectively 
modulates the biological activity of one PPAR subtype . compared to PPARcr, 
the method comprising: 

(a) providing an atomic structure coordinate set describing a 
PPARa ligand binding domain structure and at least one other 
atomic structure coordinate . set describing a PPAR ligand 
binding domain, each ligand binding domain comprising a ligand 
binding site; 

(b) comparing the PPAR atomic structure coordinate sets to identify 
at least one d'rference between the sets; 

(c) designing a candidate ligand predicted to interact with the 
difference of step (b); 

(d) synthesizing the candidate ligand; and 

(e) testing the synthesized candidate ligand for an ability to 
selectively modulate a PPAR subtype as compared to PPARa, 
whereby a PPAR modulator that selectively modulates the 
biological activity of one PPAR subtype compared to PPARa is 
identified. 
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SEQUENCE LISTING 



<110> SmithKline Beecham Corporation 
Xu, Dr. Eric 
Montana, Valerie 
Lambert, Dr. Millard 



<120> Crystallized PPARa Ligand Binding Domain in Complex With a 
Ligand a: 
Activity 



Ligand and Method of Determining and Designing Modulators Of PPAR 



<130> Attorney Docket No. PD4071 



<160> 9 



<170> Patentln version 3.0 



<210> i . ") 



<211> 1731 



<212> DNA 



<213> Homo sapiens 



<220> 
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<300> 

<308> Genbank/S7 434 9 

<309> 1995-04-12 

<313> <1)..(1681) 



<400> 1 

gttctggaggctgggaagtt caagatcaaa gtgccagcag attcagtgtc atgtgaggac 60 
gtgcttcctg cttcatagat aagagcttgg agctcggcgc acaaccagca ccatctggtc 120 



gcg atg gtg gac acg gaa age cca etc tgc ccc etc tec cca etc gag 
Met Val Asp Thr Glu Ser Pro Leu Cys Pro Leu Ser Pro Leu Glu 
1 5 10 . 15 

gee ggc gat eta gag age ccg tta tct gaa gag ttc ctg caa gaa atg 
Ala Gly Asp Leu Glu Ser Pro Leu Ser Glu . Glu Phe Leu Gin Glu . Met 
20 25 30 

gga aac ate caa gag att teg caa tec ate ggc gag gat agt tct gga 
Gly Asn lie Gin Glu lie Ser Gin Ser lie Gly Glu Asp Ser Ser Gly 
35 40 45 

age ttt ggc ttt acg gaa tac cag tat tta gga age tgt cct ggc tea 
Ser Phe Gly Phe Thr Glu Tyr Gin Tyr Leu Gly Ser Cys Pro Gly Ser 
50 55 60 

gat ggc teg gtc ate acg gac acg ctt tea cca get teg age ccc tec 
Asp Gly Ser Val lie Thr Asp Thr Leu Ser Pro Ala Ser Ser Pro Ser 
65 70 75 

teg gtg act tat cct gtg gtc ccc ggc age gtg gac gag tct ccc agt 



168 



216 



264 



312 



360 



408 
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Ser Val Thr Tyr Pro Val Val Pro Gly Ser Val Asp Glu Ser Pro Ser 
80 85 90 95 

gga gca ttg aac ate gaa tgt aga ate tgc ggg gac aag gec tea ggc 456 
Gly Ala Leu Asn lie Glu Cys Arg lie Cys Gly Asp Lys Ala Ser Gly 
100 105 110 

tat cat tac gga gtc cac gcg tgt gaa ggc tgc aag ggc ttc ttt egg 504 
Tyr His Tyr Gly Val His Ala Cys Glu Gly Cys Lys Gly Phe Phe Arg 
115 120 125 

cga acg att cga etc aag ctg gtg tat gac aag tgc gac cgc age tgc 552 
Arg Thr He Arg Leu Lys Leu Val Tyr Asp Lys Cys Asp Arg Ser Cys 
130 135 140 

aag ate cag aaa aag aac aga aac aaa tgc cag tat tgt cga ttt cac 600 ) 

Lys He Gin Lys Lys Asn Arg Asn Lys Cys Gin Tyr Cys Arg Phe His 
145 150 155 

aag tgc ctt tct gtc ggg atg tea cac aac gcg. att cgt ttt gga cga . - .648 
Lys Cys Leu Ser Val Gly Met Ser His Asn Ala He Arg Phe Gly Arg 
160 165 170 175 

atg cca aga tct gag aaa gca aaa ctg aaa gca gaa att ctt ace tgt 696 
Met Pro Arg Ser Glu Lys Ala Lys Leu Lys Ala Glu He Leu Thr Cys 
180 185 190 

gaa cat gac ata gaa gat tct gaa act gca gat etc aaa tct ctg gee 744 
Glu His Asp He Glu Asp Ser Glu Thr Ala Asp Leu Lys Ser Leu Ala 
195 200 205 

aag aga ate tac gag gec tac ttg aag aac ttc aac atg aac aag gtc 792 J 

Lys Arg He Tyr Glu Ala Tyr Leu Lys Asn Phe Asn Met Asn 'Lys Val 
210 215 220 

aaa gee egg gtc ate etc tea gga aag gee agt aac aat cca cct ttt 840 
Lys Ala Arg Val He Leu Ser Gly Lys Ala Ser Asn Asn Pro Pro Phe 
225 230 235 

gtc ata cat gat atg gag aca ctg tgt atg get gag aag acg ctg gtg 888 

Val He His Asp Met Glu Thr Leu Cys Met Ala Glu Lys Thr Leu Val 

240 245 250 255 

gee aag ctg gtg gec aat ggc ate cag aac aag gag gcg gag gtc cgc 936 

Ala Lys Leu Val Ala Asn Gly He Gin Asn Lys Glu Ala Glu Val Arg 
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ate ttt cac tgc tgc cag tgc acg tea gtg gag acc gtc acg gag etc 984 

lie Phe His Cys Cys Gin Cys Thr Ser Val Glu Thr Val Thr Glu Leu 

275 280 285 

acg gaa ttc gee aag gee ate cca ggc ttc gca aac ttg gac ctg aac * 3.032 

Thr Glu Phe Ala Lys Ala lie Pro Gly Phe Ala Asn Leu Asp Leu Asn 

290 295 * 300 

gat caa gtg aca ttg eta aaa 'tac gga gtt tat gag gee ata ttc gee 1080 

Asp Gin Val Thr Leu Leu Lys Tyr Gly Val Tyr Glu Ala lie Phe Ala 
305 310 315 - 

atg ctg tct tct gtg atg aac aaa gac ggg atg ctg gta gcg tat gga 1128 

Met Leu Ser Ser Val Met Asn Lys Asp Gly Met Leu Val Ala Tyr Gly 

320 325 330 335 

aat ggg ttt ata act cgt gaa ttc eta aaa age eta agg aaa ccg ttc 1176 

Asn Gly Phe He Thr Arg Glu Phe Leu Lys Ser Leu Arg Lys Pro Phe 

340 345 350 

tgt gat ate atg gaa ccc aag ttt gat ttt gec atg aag ttc aat gca 1224 

Cys Asp He Met Glu Pro Lys Phe Asp Phe Ala Met Lys Phe Asn Ala 

355 360 365 

ctg gaa ctg gat gac agt gat ate tec ctt ttt gtg get get ate att 1272 

Leu Glu Leu Asp Asp Ser Asp He Ser Leu Phe Val Ala Ala He lie 

370 375 380 

tgc tgt gga gat cgt cct ggc ctt eta aac gta gga cac att gaa aaa 1320 

Cys Cys Gly Asp Arg Pro Gly Leu Leu Asn Val Gly His He Glu Lys 
3S5 390 395 

atg cag gag ggt att gta cat gtg etc aga etc cac ctg cag age aac 1368 

Met Gin Glu Gly He Val His Val Leu Arg Leu His Leu Gin Ser Asn 

400 405 410 415 

cac ccg gac gat ate ttt etc ttc cca aaa ctt ctt caa aaa atg gca 1416 

His Pro Asp Asp He Phe Leu Phe Pro Lys Leu Leu Gin Lys Met Ala 

420 425 430 

gac etc egg cag ctg gtg acg gag cat gcg cag ctg gtg cag ate ate 1464 

Asp Leu Arg Gin Leu Val Thr Glu His Ala Gin Leu Val Gin lie He 
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aag aag acg gag teg gat get gcg ctg cac ccg eta ctg cag gag ate 1512 
Lys Lys Thr Glu Ser Asp Ala Ala Leu His Pro Leu Leu Gin Glu lie 

450 455 460 

tac agg gac atg tac tgagttcctt cagatcagcc acaccttttc caggagttct 1567 
Tyr Arg Asp Met Tyr 
465 

gaagctgaca gcactacaaa ggagacgggg gagcagcacg attttgeaca aatatccacc 1627 
actttaacct tagagcttgg acagtctgag ctgtaggtaa ceggcatatt attccatatc 1687 
tttgttttaa ccagtacttc taagagcata gaactcaaat gctg 1731 



<210> 2 

<211> 4 68 

<212> PRT 

<213> Homo sapiens 



<400> 2 

Met Val Asp Thr Glu Ser Pro Leu Cys ' Pro Leu Ser Pro Leu Glu Ala 
15 10 15 

Gly Asp Leu Glu Ser Pro Leu Ser Glu Glu Phe Leu Gin Glu Met Gly 
20 25 30 

Asn lie Gin Glu lie Ser Gin Ser lie Gly Glu Asp Ser Ser Gly Ser 
35 40 45 

Phe Gly Phe Thr Glu Tyr Gin Tyr Leu Gly Ser Cys Pro Gly Ser Asp 
50 55 60 



Gly Ser Val lie Thr Asp Thr Leu Ser Pro Ala Ser Ser Pro Ser Ser 
65 70 75 80 
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Val Thr Tyr Pro Val Val Pro Gly Ser Val Asp Glu Ser Pro Ser Gly 
85 90 95 

Ala Leu Asn lie Glu Cys Arg lie Cys Gly Asp Lys Ala Ser Gly Tyr 
100 105 110 



His Tyr Gly Val His Ala Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg 
115 120 125 

Thr lie Arg Leu Lys Leu Val Tyr Asp Lys Cys Asp Arg Ser Cys Lys 

130 135 . 140 

•*.*.. ':- . • / £ ,-■ . ■ • ■ ; 

He Gin Lys Lys Asn Arg Asn Lys Cys Gin Tyr Cys Arg Phe His Lys 

145 150 155 160 

Cys Leu Ser Val Gly Met Ser His Asn Ala He Arg Phe Gly Arg Met 
165 170 175 

Pro Arg Ser Glu Lys Ala Lys Leu^Lys Ala Glu He Leu Thr Cys Glu 
180 185 190 

His Asp He Glu Asp Ser Glu Thr Ala 'Asp Leu Lys Ser Leu " Ala Lys 
195 200 205 

Arg He Tyr Glu Ala Tyr Leu Lys Asn' Phe Asn Met Asn Lys Val Lys 
210 215 220 

Ala Arg Val He Leu Ser Gly Lys Ala Ser Asn Asn Pro Pro Phe Val 
225 230 235 240 

He His Asp Met Glu Thr Leu Cys Met Ala Glu Lys Thr Leu Val Ala 
245 250 255 

Lys Leu Val Ala Asn Gly He Gin Asn Lys Glu Ala Glu Val Arg He 
260* 265 270 



Phe His Cys Cys Gin Cys Thr Ser Val Glu Thr Val Thr Glu Leu Thr 
275 280 - 285 
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Glu Phe Ala Lys Ala lie Pro Gly Phe Ala Asn Leu Asp Leu Asn Asp 
290 295 300 

Gin Val Thr Leu Leu Lys Tyr Gly Val Tyr Glu Ala lie Phe Ala Met 
305 310 315 320 

Leu Ser Ser Val Met Asn Lys Asp Gly Met Leu Val Ala Tyr Gly Asn 
325 330 335 

Gly Phe He Thr Arg Glu Phe Leu Lys Ser Leu Arg Lys Pro Phe Cys 
340 345 350 

Asp He Met Glu Pro Lys Phe Asp Phe Ala Met Lys Phe Asn Ala Leu 
355 360 365 

Glu Leu Asp Asp Ser Asp lie Ser Leu Phe Val Ala Ala He He Cys 

370 . .- :.r . • * -375. r 380 -. . : '.. 

Cys Gly Asp Arg Pro Gly Leu Leu Asn Val Gly His He Glu Lys Met 

385 390 ...... 395 - ,400 

Gin Glu Gly He Val His Val Leu Arg Leu His Leu Gin Ser Asn His 
405 410 415 

Pro Asp Asp He Phe Leu Phe Pro Lys Leu Leu Gin Lys Met Ala Asp 
420 425 430 

Leu Arg Gin Leu Val Thr Glu His Ala Gin Leu Val Gin He He Lys 
435 440 445 

Lys Thr Glu Ser Asp Ala Ala Leu His Pro Leu Leu Gin Glu He Tyr 
450 455 460 

Arg Asp Met Tyr 
465 



<210> 3 
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<211> 867 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS r - " ■ " ■ S 

<222> (1)..(867) 

<223> Residues 192-468, preceded by His tag 



<300> 

<308> Genbank/S7434 9 

<309> 1995-04-12 . 

<313> (192) . . (468) 



<400> 3 

atg aaa aaa ggt cat cat cat cat cat cat ggt gaa cac gac ctg aaa 48 
Met Lys Lys Gly His His His His His His Gly Glu His Asp Leu Lys 
1 5 10 15 

gat teg gaa act gca gac etc aaa tct ctg ggc aag aga ate cac gaa 96 
Asp Ser Glu Thr Ala Asp Leu Lys Ser Leu Gly Lys Arg lie His Glu 
20 25 30 

gee tac ctg aag aac ttc aac atg aac aag gtc aag gee egg gtc ata 144 
Ala Tyr Leu . Lys Asn Phe Asn Met Asn Lys Val Lys Ala Arg Val lie 
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35 40 45 



etc gcg gga aag acc age aac aac ccg cct ttt gtc ata cat gac atg 192 
Leu Ala Gly Lys Thr Ser Asn Asn Pro Pro Phe Val lie His Asp Met 
50 55 60 

gag acc ttg tgt atg gec gag aag acg ctt gtg gec aag atg gtg gec 240 
Glu Thr Leu Cys Met Ala Glu Lys Thr Leu Val Ala Lys Met Val Ala 
65 70 75 80 

aac ggc gtc gaa gac aaa gag gca gag gtc cga ttc ttc cac tgc tgc 288 
Asn Gly Val Glu Asp Lys Glu Ala Glu Val Arg Phe Phe His Cys Cys 
85 90 95 

cag tgc atg tec gtg gag acc gtc acg gag etc aca gaa ttt gec aag 336 
Gin cys Met Ser Val Glu Thr Val Thr Glu Leu Thr Glu Phe Ala Lys 
100 105 110 

get ate cca ggc ttt gca aac ttg gac ttg aac gac caa gtc acc ttg 384 
Ala lie Pro Gly Phe Ala Asn Leu Asp Leu Asn Asp Gin Val Thr Leu 
115 120 125 

eta aag tac ggt gtg tat gaa gec ate ttc acg atg ctg tec tec ttg 432 
Leu Lys Tyr Gly Val Tyr Glu Ala lie Phe Thr Met Leu Ser Ser Leu 
130 135 ' 140 

atg aac aaa gac ggg atg ctg ate gcg tac ggc aat ggc ttt ate aca 480 
Met Asn Lys Asp Gly Met Leu lie Ala Tyr Gly Asn Gly Phe lie Thr 
145 150 155 160 

cgc gag ttc ctt aag aac ctg agg aag ccg ttc tgt gac ate atg gaa 528 
Arg Glu Phe Leu Lys Asn Leu Arg Lys Pro Phe Cys Asp lie Met Glu 
165 170 175 

ccc aag ttt gac ttc get atg aag ttc aat gee tta gaa ctg gat gac 576 
Pro Lys Phe Asp Phe Ala Met Lys Phe Asn Ala Leu Glu Leu Asp Asp 
180 185 190 

agt gac att tec ctg ttt gtg get get ata att tgc tgt gga gat egg 624 
Ser Asp lie Ser Leu Phe Val Ala Ala lie lie Cys Cys Gly Asp Arg 
195 200 205 

cct ggc ctt eta aac ata ggc tac att gag aag ttg cag gag ggg att 672 
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Pro Gly Leu Leu Asn lie Gly Tyr lie Glu Lys Leu Gin Glu Gly lie 
210 215 220 

gtg cac gtg ctt aag etc cac ctg cag age aac cat cca gat gac acc 720 
Val His Val Leu Lys Leu His Leu Gin Ser Asn His Pro Asp Asp Thr 
225 230 235 240 

ttc etc ttc cca aag etc ctt caa aaa atg gtg gac ctt egg cag ctg 768 
Phe Leu Phe Pro Lys Leu Leu Gin Lys Met Val Asp Leu Arg Gin Leu " 
245 250 255 



gtc acg gag cat gcg cag etc gta cag gtc ate aag aag acc gag tec 816 
Val Thr Glu His Ala Gin Leu Val Gin Val lie Lys Lys Thr Glu Ser 
260 265 270 

gac gca gcg ctg cac cca ctg ttg caa gag ate tac aga gac atg tac 864 
Asp Ala Ala Leu His Pro Leu Leu Gin Glu lie Tyr Arg Asp Met Tyr 
275 280 285 

taa 867 



<210> 4 

<211> 288 

<212> PRT 

<213> Homo sapiens 



<400> 4 

Met Lys Lys Gly His His His His 
1 5 

Asp Ser Glu Thr Ala Asp Leu Lys 
20 

Ala Tyr Leu Lys Asn Phe Asn Met 
35 40 



His His Gly Glu His Asp Leu Lys 
10 15 

Ser Leu Gly Lys Arg He His Glu 
25 30 
Asn Lys Val. Lys Ala Arg Val lie 
45 
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Leu Ala Gly Lys Thr Ser Asn Asn Pro Pro Phe Val lie His Asp Met 
50 55 €0 

Glu Thr Leu Cys Met Ala Glu Lys Thr Leu Val Ala Lys Met Val Ala 
65 70 75 80 

Asn Gly Val Glu Asp Lys Glu Ala Glu Val Arg Phe Phe His Cys Cys 
85 90 95 

Gin Cys Met Ser Val Glu Thr Val Thr Glu Leu Thr Glu Phe Ala Lys 

100 105 110 

Ala lie Pro Gly Phe Ala Asn Leu Asp Leu Asn Asp Gin Val Thr Leu 

115 120 125 y 

Leu Lys Tyr Gly Val Tyr. Glu Ala lie Phe Thr Met Leu Ser „Ser Leu 
130 135 , 140 

Met Asn Lys Asp Gly Met Leu lie Ala Tyr Gly Asn Gly Phe lie Thr 
145 150 155 160 

Arg Glu Phe Leu Lys Asn Leu Arg Lys Pro Phe Cys Asp lie Met Glu 
165 170 175 

Pro Lys Phe Asp Phe Ala Met Lys Phe Asn Ala Leu Glu Leu Asp Asp 
180 185 190 

Ser Asp lie Ser Leu Phe Val Ala Ala lie lie Cys Cys Gly Asp Arg 

195 200 205 J) 

Pro Gly Leu Leu Asn lie Gly Tyr lie Glu Lys Leu Gin Glu Gly lie 
210 215 220 

Val His Val Leu Lys Leu His Leu Gin Ser Asn His Pro Asp Asp Thr 
225 230 235 240 

Phe Leu Phe Pro Lys Leu Leu Gin Lys Met Val Asp Leu Arg Gin Leu 

245 250 255 

Val Thr Glu His Ala Gin Leu Val Gin Val He Lys Lys Thr Glu Ser 
260 265 270 
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Asp Ala Ala Leu His Pro Leu Leu Gin Glu He Tyr Arg Asp Met Tyr 
275 : 280 285 

<210> 5 
<211> 2961 
<212> DNA 

<213> Homo sapiens ' " 

<220> 

<221> CDS 

<222> (338) (1660) 

<300> 

<308> Genbank/L07592 

<309> 1993-04-27 

<313> (338) (1660). 

<400> 5 

gaattctgcg gagcctgcgg gacggcggcg ggttggcccg taggcagccg ggacagtgtt 60 

gtacagtgtt ttgggcatgc acgtgatact cacacagtgg cttctgctca ccaacagatg 120 

aagacagatg caccaacgag ggtctggaat ggtctggagt ggtctggaaa gcagggtcag 180 

atacccctgg aaaactgaag cccgtggagc aatgatctct acaggactgc ttcaaggctg 240 
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atgggaacca ccctgtagag gtccatctgc gttcagaccc agacgatgcc agagctatga 300 

ctgggcctgc aggtgtggcg ccgaggggag atcagcc atg gag cag cca cag gag 355 

Met Glu Gin Pro Gin Glu 
1 5 

gaa gcc cct gag gtc egg gaa gag gag gag aaa gag gaa gtg gca gag . 403 
Glu Ala Pro Glu Val Arg Glu Glu Glu Glu Lys Glu Glu Val Ala Glu 
10 15 20 

gca gaa gga gcc cca gag etc aat ggg gga cca cag cat gca ctt cct 451 
Ala Glu Gly Ala Pro Glu Leu Asn Gly Gly Pro Gin His Ala Leu Pro 
25 30 35 

tec age age tac aca gac etc tec egg age tec teg cca ccc tea ctg 499 
Ser Ser Ser Tyr Thr Asp Leu Ser Arg Ser Ser Ser Pro Pro Ser Leu 
40 45 50 

ctg gac caa ctg cag atg ggc tgt gac ggg gcc tea tgc ggc age etc 547 
Leu Asp Gin Leu Gin Met Gly Cys Asp Gly Ala Ser Cys Gly Ser Leu 
55 60 €5 70 

aac atg gag tgc egg gtg tgc ggg gac aag gca teg ggc ttc cac tac 595 
Asn Met Glu Cys Arg Val Cys Gly Asp Lys Ala Ser Gly Phe His Tyr 
75 80 85 

ggt gtt cat gca tgt gag ggg tgc aag ggc ttc ttc cgt cgt acg ate 643 
Gly Val His Ala Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Thr He 
90 95 100 

cgc atg aag ctg gag tac gag aag tgt gag cgc age tgc aag att cag 691 

Arg Met Lys Leu Glu Tyr Glu Lys Cys Glu Arg Ser Cys Lys He Gin } 
105 110 115 

aag aag aac cgc aac aag tgc cag tac tgc cgc ttc cag aag tgc ctg 739 
Lys Lys Asn Arg Asn Lys Cys Gin Tyr Cys Arg Phe Gin Lys Cys Leu 
120 125 130 



gca ctg ggc atg tea 
Ala Leu Gly Met Ser 
135 

get gag aag agg aag 
Ala Glu Lys Arg Lys 
155 



cac aac get ate cgt 
His Asn Ala He Arg 
140 

ctg gtg gca ggg ctg 
Leu Val Ala Gly Leu 
160 



ttt ggt egg atg ccg gag 787 
Phe Gly Arg Met Pro Glu 
145 150 
act gca aac gag ggg age 835 
Thr Ala Asn Glu Gly Ser 
165 
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cag tac aac cca cag gtg gcc gac ctg aag gcc ttc tec aag cac ate - 883 
Gin Tyr Asn Pro Gin Val Ala Asp Leu Lys Ala Phe Ser Lys His lie 1 
170 175 180 

tac aat gcc tac ctg aaa aac ttc "aac atg acc aaa- aag -aag- gcc -cgc 931 
Tyr Asn Ala Tyr Leu Lys Asn Phe Asn Met Thr Lys Lys Lys Ala Arg 
185 190 195 

age ate etc acc ggc aaa gcc age cac acg gcg ccc ttt gtg ate cac 979 
Ser lie Leu Thr Gly Lys Ala Ser His Thr ' Ala -Pro~Phe Val" Ile-His ♦ " 
200 < . 205 210 

gac ate gag aca ttg ; tgg ' cag ~gca gag aag" ggg ctg gtg tgg aag cag 1027 
Asp He Glu Thr Leu Trp Gin Ala Glu Lys Gly Leu Val Trp Lys Gin 
215 220 225 230 

ttg gtg aat ggc ctg'cct ccc tac aag gag atc : age gtg cac gtc ttc 1075 
Leu Val Asn Gly Leu Pro Pro Tyr Lys Glu lie Ser 7 Val His-Val-Phe ' 
235 240 245 

tac cgc tgc cag tgc acc aca gtg gag acc gtg egg gag etc act gag 1123 
Tyr Arg Cys Gin Cys Thr Thr Val Glu Thr Val Arg Glu Leu Thr Glu 
250 255 260 

ttc gcc aag age ate ccc age ttc age age etc ttc etc aac gac cag ' 1171 
Phe Ala Lys Ser He Pro Ser Phe Ser Ser Leu Phe Leu Asn Asp Gin 
265 270 275 

gtt acc ctt etc aag tat ggc gtg cac gag gcc ate ttc gcc atg ctg 1219 
Val Thr Leu Leu Lys Tyr Gly Val His Glu Ala He Phe Ala Met Leu 
280 285 290 

gcc tct ate gtc aac aag gac ggg ctg ctg gta gcc aac ggc agt ggc 1267 
Ala Ser He Val Asn Lys Asp Gly Leu Leu Val Ala Asn Gly Ser Gly 
295 300 305 310 

ttt gtc acc cgt gag ttc ctg cgc age etc cgc aaa ccc ttc agt gat . 1315 
Phe Val Thr Arg Glu Phe Leu Arg Ser Leu Arg Lys Pro Phe Ser Asp 
. 315 320 325 

ate att gag cct aag ttt gaa ttt get gtc aag ttc aac gcc ctg gaa 1363 
He He Glu Pro Lys Phe Glu Phe Ala Val Lys Phe Asn Ala Leu Glu 
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330 335 

ctt gat gac agt gac ctg gcc eta ttc att gcg gec ate att ctg tgt 1411 
Leu Asp Asp Ser Asp Leu Ala Leu Pbe He Ala Ala He He Leu Cys 
345 350 355 

gga gac egg cca ggc etc atg aac gtt cca egg gtg gag get ate cag 1459 
Gly Asp Arg Pro Gly Leu Met Asn Val Pro Arg Val Glu Ala lie Gin 
360 365 370 



PCT/US02/03278 

340 



gac ace ate ctg cgt gcc etc gaa ttc cac ctg cag gcc aac cac cct 1507 
Asp Thr He Leu Arg Ala Leu Glu Pbe His Leu Gin Ala Asn His Pro 
375 380 385 390 

gat gcc cag tac etc ttc ccc aag ctg ctg cag aag atg get gac ctg , 1555 
Asp Ala Gin Tyr Leu Phe Pro Lys Leu Leu Gin Lys Met Ala Asp Leu 
395 400 405 

egg caa ctg gtc ace gag cac gcc cag atg atg cag egg ate aag aag 1603 
Arg Gin Leu Val Thr Glu. His Ala Gin . Met Met . Gin Arg lie Lys Lys 
410 415 420 

ace gaa acc gag acc teg ctg cac cct ctg etc cag gag ate tac aag 1651 
Thr Glu Thr Glu Thr Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys 
425 430 435 

gac atg tac taaeggegge acccaggcct ccctgcagac tccaatgggg 1700 
Asp Met Tyr 
440 

ccagcactgg aggggcccac ccacatgact tttccattga ccagctctct tcctgtcttt 1760 

gttgtctccc tctttctcag ttcctctttc ttttctaatt cctgttgctc tgtttcttcc 1820 

tttctgtagg tttctctctt cccttctccc ttctcccttg ccctcccttt ctctctccta 1880 

tccccacgtc tgtcctcctt tcttattctg tgagatgttt tgtattattt caccagcagc 1940 

atagaacagg acctctgctt ttgcacacct tttccccagg agcagaagag agtgggcctg 2000 

ccctctgccc catcattgea cctgcaggct taggtcctca cttctgtctc ctgtcttcag 2060 

agcaaaagac ttgagecate caaagaaaca ctaagctctc tgggcctggg ttccagggaa 2120 
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ggctaagcat ggcctggact gactgcagcc ccctatagtc atggggtccc tgctgcaaag 2180 

gacagtggca gaccccggca gtagagccga gatgcctccc caagactgtc attgcccctc 2240 

cgatcgtgag gccacccact gacccaatga tcctctccag cagcacacct cagccccact 2300 

gacacccagt gtccttccat cttcacactg gtttgccagg ccaatgttgc tgatggcccc 2360 

tccagcacac acacataagc actgaaatca ctttacctgc aggcaccatg cacctccctt 2420 

. • - c . . : f - 

ccctccctga ggcaggtgag aacccagaga gaggggcctg caggtgagca ggcagggctg 24 80 

\ 

ggccaggtct ccggggaggc aggggtcctg caggtcctgg -tgggtcagcc cagcacctcg ' 2540 

cccagtggga gcttcccggg ataaactgag cctgttcatt ctgatgtcca tttgtcccaa 2600 

tagctctact gccctcccct tcccctttac tcagcccagc tggccaccta gaagtctccc 2660 

tgcacagcct ctagtgtccg gggaccttgt gggaccagtc ccacaccgct ggtccctgcc 2720 

ctcccctgct cccaggttga ggtgcgctca cctcagagca gggccaaagc acagctgggc 27 80 

atgccatgtc tgagcggcgc' agagccctcc aggcctgcag gggcaagggg ctggctggag 2840 

tctcagagca cagaggtagg agaactgggg ttcaagccca ggcttcctgg gtcctgcctg . 2900 

gtcctccctc ccaaggagcc attctatgtg actctgggtg gaagtgccca gcccctgcct 2960 

g 2961 

<210> 6 

<211> • 4 41 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Glu Gin Pro Gin Glu Glu Ala Pro Glu Val Arg Glu Glu. Glu Glu 
1 5 10 15 
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Lys Glu Glu Val Ala Glu Ala Glu Gly Ala Pro Glu Leu Asn Gly Gly 
20 25 30 

Pro Gin His Ala Leu Pro Ser Ser Ser Tyr Thr Asp Leu Ser Arg Ser 
35 40 45 

Ser Ser Pro Pro Ser Leu Leu Asp Gin Leu Gin Met Gly Cys Asp Gly 
50 55 60 

Ala Ser Cys Gly Ser Leu Asn Met Glu Cys Arg Val Cys Gly Asp Lys 
65 70 75 80 

Ala Ser Gly Phe His Tyr Gly Val His Ala Cys Glu Gly Cys Lys Gly °^ 

85 ' : ' 90 95 

Phe Phe Arg Arg Thr lie Arg Met Lys Leu Glu Tyr Giu Lys Cys Glu 

100. _ _105 110 . _ 

Arg Ser Cys Lys . lie Gin Lys .Lys .Asn Arg Asn Lys Cys Gin Tyr Cys 
115 120 125 

Arg Phe Gin Lys Cys Leu Ala Leu Gly Met Ser His Asn Ala lie Arg 
130 135 140 



Phe Gly Arg Met Pro Glu Ala Glu Lys Arg Lys Leu Val Ala Gly Leu 
145 150 155 160 



Thr Ala Asn Glu Gly Ser Gin Tyr Asn Pro Gin Val Ala Asp Leu Lys 
165 170 175 

Ala Phe Ser Lys His He Tyr Asn Ala Tyr Leu Lys Asn Phe Asn Met 
180 185 190 

Thr Lys Lys Lys Ala Arg Ser He Leu Thr Gly Lys Ala Ser His Thr 
195 200 205 

Ala Pro Phe Val He His Asp He Glu Thr Leu Trp Gin Ala Glu Lys 
210 215 220 



) 
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Gly Leu Val Trp Lys Gin Leu Val Asn Gly Leu Pro Pro Tyr Lys Glu 
225 230 235 240 

lie Ser Val His Val Phe Tyr Arg Cys Gin Cys Thr Thx Val Glu Thx 
245 250 255 

Val Arg Glu Leu Thr Glu Phe Ala Lys Ser lie Pro Ser Phe Ser Ser 
260 265 270 

Leu Phe Leu Asn Asp Gin Val Thr Leu Leu Lys Tyr Gly Val His Glu 
275 280 285 

Ala lie Phe Ala Met Leu Ala Ser lie Val Asn Lys Asp Gly Leu Leu 
290 295 300 

Val Ala Asn Gly Ser Gly Phe Val Thr Arg Glu Phe Leu Arg Ser Leu 
305 310 315 320 

Arg Lys Pro Phe Ser Asp lie He Glu Pro Lys Phe Glu Phe Ala Val 
325 330 335 - 



Lys Phe Asn Ala Leu Glu Leu Asp Asp Ser Asp Leu Ala Leu Phe He 
340 345 350 

Ala Ala He lie Leu Cys Gly Asp Arg Pro Gly Leu Met Asn Val Pro 
355 360 365 

Arg Val Glu Ala He Gin Asp Thr He Leu Arg Ala Leu Glu Phe His 

370 . 375 380 . . 



Leu Gin Ala Asn His Pro Asp Ala Gin Tyr Leu Phe Pro Lys Leu Leu 
385 390 395 400 



Gin Lys Met Ala Asp Leu Arg Gin Leu Val Thr Glu His Ala Gin Met 
405 410 415 
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Met Gin Arg lie Lys Lys Thx Glu Thr Glu Thr Ser Leu His Pro Leu 
420 425 430 

Leu Gin Glu lie Tyr Lys Asp Met Tyr 
435 440 

<210> 7 

<211> 825 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (1)..<825) 



<400> 7 



cag tac aac cca cag gtg gcc gac ctg aag gcc ttc tec aag cac ate 48 
Gin Tyr Asn Pro Gin Val Ala Asp Leu Lys Ala Phe Ser Lys His lie 
15 10 15 

tac aat gcc tac ctg aaa aac ttc aac atg acc aaa aag aag gcc cgc 96 
Tyr Asn Ala Tyr Leu Lys Asn Phe Asn Met Thx Lys Lys Lys Ala Arg 
20 25 30 

age ate etc acc ggc aaa gcc age cac acg gcg ccc ttt gtg ate cac 144 
Ser He Leu Thr Gly Lys Ala Ser His Thr Ala Pro Phe Val lie His 
35 40 45 

gac ate gag aca ttg tgg cag gca gag aag ggg ctg gtg tgg aag cag 192 
Asp He Glu Thr Leu Trp Gin Ala Glu Lys Gly Leu Val Trp Lys Gin 



) 
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50 55 60 

ttg gtg aat ggc ctg cct ccc tac aag gag ate age gtg cac gtc-ttc 240 

-Leu Val Asn Gly Leu Pro Pro Tyr Lys Glu lie Ser Val His Val Phe 
65 70 75 80 

tac cgc tgc cag tgc acc aca gtg gag acc gtg egg gag etc act gag 288 

Tyr Arg Cys Gin Cys Thr Thr Val Glu Thr Val Arg Glu Leu Thr Glu 
85 '90 95 

ttc gec aag age ate ccc age ttc age age etc ttc etc aac gac cag 336 

Phe Ala Lys Ser lie Pro Ser Phe Ser Ser Leu Phe Leu Asn Asp Gin 
100 105 110 

gtt acc ctt etc aag tat ggc gtg cac gag gee ate ttc gee atg ctg 384 

Val Thr Leu Leu Lys Tyr Gly Val His Glu Ala lie Phe Ala Met Leu 
115 120 125 

gee tct ate gtc aac aag gac ggg ctg ctg gta gec aac ggc agt ggc 432 

Ala Ser lie Val Asn Lys Asp Gly Leu Leu Val Ala Asn Gly Ser Gly 

130 135 140 

ttt gtc acc cgt gag ttc ctg cgc age etc cgc aaa ccc ttc agt gat 480 

Phe Val Thr Arg Glu Phe Leu Arg Ser Leu Arg Lys Pro Phe Ser Asp 
145 150 155 160 

ate att gag cct aag ttt gaa ttt get gtc aag ttc aac gee ctg gaa 528 

lie lie Glu Pro Lys Phe Glu Phe Ala Val Lys Phe Asn Ala Leu Glu 

165 . 170 175 

ctt gat gac agt gac ctg gee eta ttc att gcg gec ate att ctg tgt 57 6 

Leu Asp Asp Ser Asp Leu Ala Leu Phe lie Ala Ala lie He Leu Cys 
180 185 190 

gga gac egg cca ggc etc atg aac gtt cca egg gtg gag get ate cag 624 

Gly Asp Arg Pro Gly Leu Met Asn Val Pro Arg Val Glu Ala He Gin 
195 200 205 

gac acc ate ctg cgt gec etc gaa ttc cac ctg cag gec aac cac cct 672 

Asp Thr He Leu Arg Ala Leu Glu Phe His Leu Gin Ala Asn His Pro 

210 215 220 

gat gec cag tac etc ttc ccc aag ctg ctg cag aag atg get gac ctg 720 

Asp Ala Gin Tyr Leu Phe Pro Lys Leu Leu Gin Lys Met Ala Asp Leu 
225 230 235 240 
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egg caa ctg gtc acc gag cac gec cag atg atg cag egg ate aag aag 768 
Arg Gin Leu Val Thr Glu His Ala Gin Met Met Gin Arg He Lys Lys 
245 250 255 

acc gaa acc gag acc teg ctg cac cct ctg etc cag gag ate tac aag 816 
Thr Glu Thr Glu Thr Ser Leu His Pro Leu Leu Gin Glu He ; Tyr Lys 
260 265 270 

gac atg tac 825 
Asp Met Tyr 

275 . ........ 



<210> 8 

<211> 275 

<212> PRT 

<213> Homo sapiens 



<40O> 8 

Gin Tyr Asn Pro Gin Val Ala Asp Leu Lys Ala Phe Ser Lys His He 
1 5 - 10 15 

Tyr Asn Ala Tyr Leu Lys Asn Phe Asn Met Thr Lys Lys Lys Ala Arg 
20 25 30 

Ser He Leu Thr Gly Lys Ala Ser His Thr Ala Pro Phe Val He His 
35 40 45 

Asp He Glu Thr Leu Trp Gin Ala Glu Lys Gly Leu Val Tip Lys Gin 
50 55 60 

Leu Val Asn Gly Leu Pro Pro Tyr Lys Glu He Ser Val His Val Phe 
65 70 75 80 
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Tyr Arg Cys Gin Cys Thr Thr Val Glu Thr Val Arg Glu Leu Thr Glu 
85 90 95 

Phe Ala Lys Ser lie Pro Ser Phe Ser Ser Leu Phe Leu Asn Asp Gin 
100 105 110 

Val Thr Leu Leu Lys Tyr Gly Val His Glu Ala lie Phe Ala Met Leu 
115 120 125 

Ala Ser lie Val Asn Lys Asp Gly Leu Leu Val Ala Asn Gly Ser Gly 
130 135 140 

Phe Val Thr Arg Glu Phe Leu Arg Ser Leu Arg Lys Pro Phe Ser Asp 
145 150 155 160 

lie lie Glu Pro Lys Phe Glu Phe Ala Val Lys Phe Asn Ala Leu Glu 
165 170 175 

Leu Asp Asp Ser Asp Leu Ala Leu Phe lie Ala Ala lie lie Leu Cys 
180 185 190 

Gly Asp Arg Pro Gly Leu Met Asn Val Pro Arg Val Glu Ala lie Gin 
195 200 205 

Asp Thr lie Leu Arg Ala Leu Glu Phe His Leu Gin Ala Asn His Pro 
210 215 220 

Asp Ala Gin Tyr Leu Phe Pro Lys Leu . Leu Gin Lys Met Ala Asp Leu 
225 230 235 240 



Arg Gin Leu Val Thr Glu His Ala Gin Met Met Gin Arg He Lys Lys 
245 250 255 

Thr Glu Thr Glu Thr Ser Leu His Pro Leu Leu Gin Glu He Tyr Lys 
260 265 270 

Asp Met Tyr 
275 
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<210> 9 
<211> 11 
<212> PRT 

<213> Artificial Purification Sequence 
<400> 9 

Met Lys Lys Gly His His His His His His Gly 
15 10 
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